首页 | 本学科首页   官方微博 | 高级检索  
     检索      

蚁群-遗传融合的文本聚类算法
引用本文:张云,冯博琴,麻首强,刘连梦.蚁群-遗传融合的文本聚类算法[J].西安交通大学学报,2007,41(10):1146-1150.
作者姓名:张云  冯博琴  麻首强  刘连梦
作者单位:西安交通大学电子与信息工程学院,710049,西安
基金项目:国家高技术研究发展计划(863计划);西安交通大学教育振兴行动计划资助项目
摘    要:针对蚁群算法容易出现停滞现象而不能对解空间进行全面搜索的问题,提出了一种蚁群-遗传融合的文本聚类算法.该算法将影响蚁群算法性能的4个参数作为遗传算法中的染色体进行编码,基于此又设计出相应的适应度函数以及选择交叉变异算子,通过多次迭代找出最优的参数组合,并将其应用到文本聚类问题上.经与经典的k均值聚类算法、基本的蚁群聚类算法的仿真比较,结果表明所提出算法的聚类效果更好,在3个测试集上的F度量值要比k均值聚类算法分别提高5.69%、48.60%、69.60%,所以更适合于处理较大规模的数据集.

关 键 词:蚁群算法  遗传算法  融合  文本聚类
文章编号:0253-987X(2007)10-1146-05
修稿时间:2007-02-01

Text Clustering Based on Fusion of Ant Colony and Genetic Algorithms
Zhang Yun,Feng Boqin,Ma Shouqiang,Liu Lianmeng.Text Clustering Based on Fusion of Ant Colony and Genetic Algorithms[J].Journal of Xi'an Jiaotong University,2007,41(10):1146-1150.
Authors:Zhang Yun  Feng Boqin  Ma Shouqiang  Liu Lianmeng
Abstract:Focusing on the problem that the ant colony algorithm gets into stagnation easily and cannot fully search in solution space,a text clustering approach based on fusion of ant colony and genetic algorithm is proposed.The four parameters which influence the performance of ant colony algorithm are encoded as chromosomes,thereby the fitness function,selection,crossover and mutation operator are designed to find the combination of optimal parameters through a number of iterations,and then it is applied to text clustering.The simulation results show that compared with the classical k-means clustering and the basic ant colony clustering algorithm,the proposed algorithm has better performance and the value of F-measure is enhanced by 5.69%,48.60% and 69.60% respectively in 3 test data sets.Therefore it is more suitable for processing larger datasets.
Keywords:ant colony algorithm  genetic algorithm  fusion  text clustering
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号