首页 | 本学科首页   官方微博 | 高级检索  
     检索      

概念属性扩展的短文本聚类算法
引用本文:白秋产,金春霞.概念属性扩展的短文本聚类算法[J].长春师范学院学报,2011(10):29-33.
作者姓名:白秋产  金春霞
作者单位:淮阴工学院电子与电气工程学院;淮阴工学院计算机工程学院;
摘    要:为了解决短文本因特征关键词稀疏而导致文本向量概念表达不够准确的问题,本文提出概念属性扩展特征关键词短文本聚类算法——STCBCFE(Short Text Clustering Based on Concept Feature Ex-pansion)。该算法通过HowNet的概念属性扩展特征关键词,以此增加文本语义特征和反映文本主题的特征关键词数量,进而提高短文本相似性;将其应用于短文本聚类,能够提高短文本的聚类效果。实验结果表明,该算法在短文本聚类的查准率和查全率上都得到了较大的提高。

关 键 词:短文本  扩展特征关键词  知网  文本聚类  K-means

Short Text Clustering Algorithm Based on Concept Feature Expansion
BAI Qiu-chan,JIN Chun-xia.Short Text Clustering Algorithm Based on Concept Feature Expansion[J].Journal of Changchun Teachers College,2011(10):29-33.
Authors:BAI Qiu-chan  JIN Chun-xia
Institution:BAI Qiu-chan1,JIN Chun-xia2(1.Faculty of Electronic and Electrical Engineering,Huaiyin Institute of Technology,Huaian 223003,China,2.Faculty of Computer Engineering,China)
Abstract:In order to solve the inaccurate concept expression problem of text vector which is caused by sparse feature keywords in short text,this paper proposes short text clustering algorithm based on concept feature expansion.The algorithm expands feature keywords through adopting HowNet's concept attributes.It not only adds the semantic features of the text and the number of feature keywords which reflect text topic,but also improves the similarity of the short text.It is used in short text clustering to increase...
Keywords:short text  feature keyword expansion  HowNet  text clustering  K-means algorithm  
本文献已被 CNKI 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号