首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于粗糙集理论的两类离散化方法研究
引用本文:张文波.基于粗糙集理论的两类离散化方法研究[J].重庆邮电大学学报(自然科学版),2011,23(5):641-646.
作者姓名:张文波
作者单位:重庆邮电大学 计算机科学与技术研究所,重庆 400065
基金项目:教育部留学回国人员科研启动基金项目(教外司留[2007]1108号);重庆邮电大学科研基金(A2006-05)
摘    要:连续属性离散化是数据预处理的关键步骤之一,在实际应用中往往通过高效的启发式算法来计算离散化结果.对基于辅助矩阵和信息熵的两类启发式离散化算法进行实验研究,分别选取每类算法中的5种典型方法,通过系列实验,对两类算法的性能进行对比研究,结果表明:辅助矩阵类算法具有相对较高的样本识别能力,但算法复杂度较高,运行时间更长,较适...

关 键 词:粗糙集  离散化  辅助矩阵  信息熵
收稿时间:2009/11/6 0:00:00

Study on two kinds of discretization methods based on rough set theory
ZHANG Wen-bo.Study on two kinds of discretization methods based on rough set theory[J].Journal of Chongqing University of Posts and Telecommunications,2011,23(5):641-646.
Authors:ZHANG Wen-bo
Institution:Institute of Computer Science and Technology, Chongqing University of Posts and Telecommunications, Chongqing 400065, P.R.China
Abstract:The discretization of continuous attributes is one of the key steps of Data Pretreatment. In practical application, continuous information systems are usually discretized by efficient heuristic algorithms. Herein, two kinds of heuristic data discretization approaches, i.e. discretization methods respectively based on auxiliary matrixes and information entropy, are thoroughly studied by simulation experiments. Five typical algorithms of each kind are realized and their performances are comprehensively compared by a series of experiments. The experimental results suggest that auxiliary matrix based algorithms are with higher capability, but more complex and time consuming, thus is appropriate for small-scaled continuous systems; and the characteristics of information entropy based algorithms are on the contrary.
Keywords:rough set  discretization  assistant matrix  information entropy
点击此处可从《重庆邮电大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《重庆邮电大学学报(自然科学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号