首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于经验风险的中心文本分类算法
引用本文:周晓堂,欧阳继红,李熙铭.基于经验风险的中心文本分类算法[J].吉林大学学报(理学版),2013,51(5):876-880.
作者姓名:周晓堂  欧阳继红  李熙铭
作者单位:吉林大学 计算机科学与技术学院, 长春 130012
基金项目:国家自然科学基金(批准号:61170092;61133011;61272208;61103091;61202308)
摘    要:采用经验风险最小化归纳原则和梯度下降方法调整传统中心分类法的类别中心向量, 解决了传统中心分类法因忽略训练集文本权值因素而导致的类别中心向量表达能力较差问题, 得到了与支持向量机分类性能基本一致的一种改进的中心分类法. 实验结果表明, 该方法是提高中心分类法分类性能的一种有效方法.

关 键 词:文本分类  中心分类法  经验风险最小化  
收稿时间:2012-12-19

Centroid Classifier Based on Empirical Risk for Text Categorization
ZHOU Xiao-tang;OUYANG Ji-hong;LI Xi-ming.Centroid Classifier Based on Empirical Risk for Text Categorization[J].Journal of Jilin University: Sci Ed,2013,51(5):876-880.
Authors:ZHOU Xiao-tang;OUYANG Ji-hong;LI Xi-ming
Institution:College of Computer Science and Technology, Jilin University, Changchun 130012, China
Abstract:Empirical risk minimization inductive principle and gradient descent method were used to fix class centroid vectors in traditional centroid based text classification algorithms so as to improve the poor expression ability of class centroid vectors in traditional centroid based text classification algorithm caused by ignoring the weighting factors of training texts. Then, an improved centroid based text classification algorithm was obtained, theperformance of which is as well as those of support vector machines. Experimental results show that the method adopted in this article is an effective mean to improve the performance of traditional centroid based text classification algorithms.
Keywords:text classification  centroid based text classification algorithms  empirical risk minimization
本文献已被 CNKI 等数据库收录!
点击此处可从《吉林大学学报(理学版)》浏览原始摘要信息
点击此处可从《吉林大学学报(理学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号