首页 | 本学科首页   官方微博 | 高级检索  
     

基于改进蚁群算法的高精度文本特征选择方法
引用本文:李凯齐,刁兴春,曹建军,李峰. 基于改进蚁群算法的高精度文本特征选择方法[J]. 解放军理工大学学报(自然科学版), 2010, 0(6): 634-639
作者姓名:李凯齐  刁兴春  曹建军  李峰
作者单位:解放军理工大学指挥自动化学院;总参第63研究所;总参第63研究所;总参第63研究所;总参通信部驻七一四厂军事代表室
基金项目:中国博士后科学基金资助项目(20090461425);江苏省博士后科研资助计划项目(0901014B)
摘    要:为了反映特征子集对分类结果的整体影响,去除噪声特征项,提出了一种基于改进蚁群算法的高精度文本特征选择方法。建立了特征选择数学模型,实现了特征选择过程与分类器分类过程间的直接关联;设计了特征优选与特征精选相结合的模型求解方案,降低了模型求解过程中的计算复杂度;提出了基于等效路径增强和局部搜索更新相结合的改进蚁群算法,提高了解的质量和稳定性。实验结果表明,与现有文本特征选择方法相比,该方法能大幅提升分类精度。

关 键 词:特征子集  蚁群优化  文本分类  特征选择  高精度

High precision method for text feature selection based onimproved ant colony optimization algorithm
LI Kai-qi,DIAO Xing-chun,CAO Jian-jun and LI Feng. High precision method for text feature selection based onimproved ant colony optimization algorithm[J]. Journal of PLA University of Science and Technology(Natural Science Edition), 2010, 0(6): 634-639
Authors:LI Kai-qi  DIAO Xing-chun  CAO Jian-jun  LI Feng
Affiliation:Institute of Command Automation,PLA Univ.of Sci.& Tech.,Nanjing 210007, China;The 63rd Research Institute,PLA General Staff Headquarters,Nanjing 210007,China;The 63rd Research Institute,PLA General Staff Headquarters,Nanjing 210007,China;The 63rd Research Institute,PLA General Staff Headquarters,Nanjing 210007,China;Representative Office of the 714 Factory,Communication Department of PLA General Staff Headquarters,Nanjing 210002,China
Abstract:To reflect the overall impact of feature subset on the classificat ion resul t and remov e the noisefeatur es, a new high-precision method was proposed for text featur e select ion based on the improv ed antco lony opt imization alg orithm. A mathematical model for feature select ion w as established to realize thedirect correlat io n betw een the feature select ion pro cess and the classif ier classificat ion pro cess. A newmodel-solv ing metho d composed of the opt imized feature select ion step and the refined feature selectionstep w as designed and the computat ional complex ity in the pro cess of model solving reduced. A newimproved ant colo ny opt imizat ion algor ithm based o n equivalent routes and local sear ch w hich improv ed thequality and stability of the problem solut io n was proposed. The experiment result s on the tw o dataset sshow the superiority of the pro posed method ov er the current feature select ion metho ds in terms ofclassif icat io n accuracy.
Keywords:feature subset    ant colony optimizat ion   tex t categor ization   feature select ion   high precision
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《解放军理工大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《解放军理工大学学报(自然科学版)》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号