首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于粗糙集和决策树的数据挖掘方法
引用本文:吴成东,许可,韩中华,裴涛.基于粗糙集和决策树的数据挖掘方法[J].东北大学学报(自然科学版),2006,27(5):481-484.
作者姓名:吴成东  许可  韩中华  裴涛
作者单位:1. 东北大学信息科学与工程学院,辽宁沈阳,110004
2. 沈阳建筑大学信息与控制工程学院,辽宁沈阳,110168
基金项目:科技部国际科技合作项目
摘    要:从粗糙集和决策树两种方法具有的优势互补性出发,提出了一种基于粗糙集和决策树相结合的数据挖掘新方法·以胶合板缺陷检测数据分析为应用对象,利用粗糙集理论对胶合板数据库中的特征信息进行缺陷识别·利用谱系聚类重心距离法对数据进行离散化处理,采用粗糙集进行属性约简,得到低维样本数据,最后用决策树方法产生决策规则·实验证明,这种数据挖掘方法保留了原始数据的内部特点,加快了获取知识的进程,提高了模型的分类准确率,增强了规则的可解释性,取得了满意的研究结果·

关 键 词:粗糙集  决策树  数据离散化  数据挖掘  谱系聚类  属性约简  
文章编号:1005-3026(2006)05-0481-04
收稿时间:2005-06-22
修稿时间:2005年6月22日

Approach to Data Mining Based on Rough Sets and Decision Tree
WU Cheng-dong,XU Ke,HAN Zhong-hua,PEI Tao.Approach to Data Mining Based on Rough Sets and Decision Tree[J].Journal of Northeastern University(Natural Science),2006,27(5):481-484.
Authors:WU Cheng-dong  XU Ke  HAN Zhong-hua  PEI Tao
Institution:(1) School of Information Science and Engineering, Northeastern University, Shenyang 110004, China; (2) School of Information and Control Engineering, Shenyang Jianzhu University, Shenyang 110168, China
Abstract:Rough sets and decision tree have complementary characteristics. A new approach to data mining is thus proposed combining both advantages. Taking the detected data of plywood defects as example, the defects are recognized as follow using eigen information in the database of plywood on the basis of rough sets theory. Decentralizes the data in the database by the algorithm of center-of-gravity distance of pedigree cluster, then reduces the conditional attribute by use of rough sets to obtain the low dimensional sample data. Decision rules are finally obtained by decision tree. The experimental result shows that, in this way, the original characteristics of data remained unchanged, and the knowledge acquisition process become speedier so as to improve the classification accuracy of model and interpretability of rules. Comparing with other the methods, such as rough sets or precision-varied rough sets, the method is proved more satisfactory.
Keywords:rough sets  decision tree  data decentralization  data mining  pedigree cluster  attribute reduction
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《东北大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《东北大学学报(自然科学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号