首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于IN算法的剪枝优化算法
引用本文:程世辉,龙金辉.基于IN算法的剪枝优化算法[J].信阳师范学院学报(自然科学版),2007,20(2):237-240.
作者姓名:程世辉  龙金辉
作者单位:1. 河南教育学院,信息技术系,河南,郑州,450014
2. 河南机电学校,河南,郑州450002
摘    要:提出一种基于IN算法构造分类器的剪枝优化算法C IN.针对IN算法利用对数似然比统计量进行假设检验存在的统计意义不明确的问题,本文算法在给定层每一节点引入了样本数阈值和属性值阈值的计算,从而保证检验的有效性.给出了算法的理论依据,并且推导出了对数似然比统计量计算公式成立条件.实验表明,该算法能够消减数据维数并且可以从大规模数据集中提取简明的规则.

关 键 词:  互信息  对数似然比统计量
文章编号:1003-0972(2007)02-0237-04
收稿时间:2006-10-12
修稿时间:2007-01-16

Pruning Optimization Algorithm Based on IN Algorithm
CHENG Shi-hui,LONG Jin-hui.Pruning Optimization Algorithm Based on IN Algorithm[J].Journal of Xinyang Teachers College(Natural Science Edition),2007,20(2):237-240.
Authors:CHENG Shi-hui  LONG Jin-hui
Abstract:This paper proposed a novel algorithm termed as CIN for classification based on IN(information-theoretic network)algorithm.Aim at ignorance of statistical significance in statistical hypotesis testing by means of the log likelihood ratio in IN algorithm,the CIN algorithm in troduces the threshold of the number of records in each node of given layer so as to guarantee reliability of testing.At the same time,the theoretic basis of the algorithm is given and precondition for the validity of the log likelihood ratio is derived.Empirical results show that the data dimensionality can be reduced and compact rules can be extracted with the CIN algorithm.
Keywords:entropy  mutual information  the log likelihood ratio statistic
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号