首页 | 本学科首页   官方微博 | 高级检索  
     检索      

粗糙集中一种连续属性预处理方法
引用本文:鄂旭,高学东,谢霖铨,贺海钧.粗糙集中一种连续属性预处理方法[J].辽宁工程技术大学学报(自然科学版),2005,24(3):400-403.
作者姓名:鄂旭  高学东  谢霖铨  贺海钧
作者单位:1. 北京科技大学,管理学院,北京,100083;辽宁工学院,计算机系,辽宁,锦州,121001
2. 北京科技大学,管理学院,北京,100083
基金项目:内蒙古自治区高等学校科学研究基金资助项目(NJ.02112)
摘    要:针对在数据挖掘中,连续属性常常需要预处理问题,应用粗糙集理论对连续属性的不完备问题、离散问题进行了研究,提出了一种连续属性预处理方法。基于条件属性与决策属性间的对应关系完成了不完备数据的填补。依据划分区间的概念、连续属性离散化含义及其本质特征,定义了划分区间的加法运算法则,以此对填补后的信息表进行了划分区间运算,并以分类质量作为离散过程迭代约束条件,实现了信息表中连续属性的离散化。通过C 编写的算法进行数值示例及测试数据库。实验结果表明此算法有效可行。

关 键 词:数据挖掘  粗糙集  预处理  划分区间  离散化
文章编号:1008-0562(2005)03-0400-04
修稿时间:2004年4月20日

An algorithm for preprocessing continuous attributes in rough sets
E Xu,GAO Xue-dong,XIE Lin-quan,HE Hai-jun.An algorithm for preprocessing continuous attributes in rough sets[J].Journal of Liaoning Technical University (Natural Science Edition),2005,24(3):400-403.
Authors:E Xu  GAO Xue-dong  XIE Lin-quan  HE Hai-jun
Abstract:In data mining, continuous attributes sometimes need to be preprocessed. Based on rough set, the incomplete problem and the discretization problem are studied. And meanwhile a new algorithm for preprocessing continuous attributes is proposed. The incomplete data were filled up depending on the correlation between condition and decision attributes. According to the concept of demarcation and its essential, the paper defines a plus rule for the interval values. After adding interval values to each attribute with iterative constraints of classification quality, the continuous attributes were discretized. The illustration and experiment were done by the C++ program and the results indicate that the method is effective for preprocessing continuous attributes.
Keywords:data mining  rough set  preprocessing  demarcation interval  discretization
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号