首页 | 本学科首页   官方微博 | 高级检索  
     检索      

Mining incomplete data—A rough set approach
作者姓名:GRZYMALA  BUSSE  Jerzy  W
作者单位:GRZYMALA-BUSSE Jerzy W(University of Kansas, Lawrence, KS 66045, USA)
摘    要:Many reallife data sets are incomplete, or in different words, are affected by missing attribute values.Three interpretations of missing attribute values are discussed in the paper: lost values (erased values), attributeconcept values(such a value may be replaced by any value from the attribute domain restricted to the concept), and “do not care” conditions (a missing attribute value may be replaced by any value from the attribute domain).For incomplete data sets three definitions of lower and upper approximations are discussed.Experiments were conducted on six typical data sets with missing attribute values, using three different interpretations of missing attribute values and the same definition of concept lower and upper approximations.The conclusion is that the best approach to missing attribute values is the lost value type.

关 键 词:rough  set  theory    incomplete  data  sets    missing  attribute  values    lost  values    attributeconcept  values    “do  not  care”  conditions    the  MLEM2  algorithm  of  rule  induction.
文章编号:1673-825X(2008)03-0282-09
收稿时间:2008/3/25 0:00:00
修稿时间:2008年3月25日

Mining incomplete data-A rough set approach
GRZYMALA BUSSE Jerzy W.Mining incomplete data-A rough set approach[J].Journal of Chongqing University of Posts and Telecommunications,2008,20(3):282-290.
Authors:GRZYMALA BUSSE Jerzy W
Institution:University of Kansas, Lawrence, KS 66045, USA
Abstract:Many real-life data sets are incomplete, or in different words, are affected by missing attribute values.Three interpretations of missing attribute values are discussed in the paper., lost values (erased values), attribute-concept values(such a value may be replaced by any value from the attribute domain restricted to the concept), and"do not care" conditions (a missing attribute value may be replaced by any value from the attribute domain). For in- complete data sets three definitions of lower and upper approximations are discussed. Experiments were conducted on six typical data sets with missing attribute values, using three different interpretations of missing attribute values and the same definition of concept lower and upper approximations. The conclusion is that the best approach to miss- ing attribute values is the lost value type.
Keywords:rough set theory  incomplete data sets  missing attribute values  lost values  attribute-concept values  "do not care" conditions  the MLEM2 algorithm of rule induction  
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《重庆邮电大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《重庆邮电大学学报(自然科学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号