首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于最大熵的文本分类算法的改进
引用本文:贺兴时,杨成成.基于最大熵的文本分类算法的改进[J].西安石油大学学报(自然科学版),2009,24(6).
作者姓名:贺兴时  杨成成
作者单位:西安工程大学理学院,陕西西安,710048
摘    要:基于最大熵模型的文本分类算法对不同测试文档的训练结果相差较大.利用Boosting机制改进基于最大熵模型的分类算法,以提高该分类算法的稳定性.实验结果表明,该改进方法可以有效改善基于最大熵模型分类算法的稳定性,且分类精度也有一定的提高.

关 键 词:文本分类算法  最大熵模型  Boosting算法  稳定性

Improvement of text categorization algorithm based on maximum entropy
HE Xing-shi,YANG Cheng-cheng.Improvement of text categorization algorithm based on maximum entropy[J].Journal of Xian Shiyou University,2009,24(6).
Authors:HE Xing-shi  YANG Cheng-cheng
Abstract:The text categorization algorithm based on maximum entropy model is a kind of effective method,and it has better performance than Bayes,KNN,SVM and etc.,which are the typical text categorization algorithms.But it has different training results to different testing documents,that is,the stability of it is worse.For this reason,the algorithm is improved using boosting mechanism in order to advance its stability.Experimental results show that the improving method is valid in improving the stability and the classification accuracy of the text categorization algorithm based on maximum entropy model.
Keywords:text categorization algorithm  maximum entropy model  boosting mechanism  stability
本文献已被 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号