首页 | 本学科首页   官方微博 | 高级检索  
     检索      

以相关性确定条件属性的概化决策树
引用本文:刘健,陈俊杰.以相关性确定条件属性的概化决策树[J].太原理工大学学报,2006(Z1).
作者姓名:刘健  陈俊杰
作者单位:太原理工大学计算机与软件学院 山西太原030024
基金项目:教育部科学技术研究重点项目(03020),山西省自然科学基金资助项目(20031038)
摘    要:在介绍了一些典型决策树分类算法的基础上,研究了一种基于相关性分析的决策树分类器。其主要思想是通过属性相关性来压缩训练集的大小并在建立决策树过程中采用此度量值来确定划分条件属性的顺序,通过阈值设定和处理简化了决策树的剪枝和优化过程,提高了处理的效率和规模。文章详细描述了算法的执行过程以及正确性证明和时间复杂性分析。

关 键 词:决策树  分类  相关性分析  效率  规模

A Generalized Decision Tree Using Relevance Analysis to Evaluate Condition Attributes
LIU Jian,CHEN Jun-jie.A Generalized Decision Tree Using Relevance Analysis to Evaluate Condition Attributes[J].Journal of Taiyuan University of Technology,2006(Z1).
Authors:LIU Jian  CHEN Jun-jie
Abstract:Efficiency and scalability are fundamental issues concerning data mining in large databases.The decision tree is an important classifier in data mining.In this paper a decision tree classifier based on relevance analysis is proposed after discussing traditional algorithms.The main idea is to compact the training data and evaluate condition attributes with correlations,and made pruning and optimization process simplified in order to get high accuracy and fast classifying speed,which leads to efficient,high-quality,multiple-level classification of large amounts of data.The accuracy of the algorithm was proven and the complexity of time was analyzed in the paper,too.
Keywords:decision tree  classification  relevance analysis  efficiency  scalability
本文献已被 CNKI 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号