首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于概念分组的Web搜索结果聚类算法
引用本文:李红梅,丁振国,周水生,周利华.基于概念分组的Web搜索结果聚类算法[J].华南理工大学学报(自然科学版),2009,37(1):130-134.
作者姓名:李红梅  丁振国  周水生  周利华
作者单位:1. 西安电子科技大学,计算机学院,陕西,西安,710071
2. 西安电子科技大学,理学院,陕西,西安,710071
摘    要:为了便于用户浏览搜索引擎返回的搜索结果,快速有效地定位有价值的Web文档,提出了基于概念分组的Web搜索结果聚类算法.首先,建立特征词同现网络,利用概念分组技术挖掘特征词之间的语义关联,形成主题概念类;然后,计算文档与各概念类之间的距离,据此实现Web搜索结果的聚类;最后,综合考虑特征词在类内和文档集中的重要性进行类别标签的选择.实验结果表明本算法具有较好的聚类性能,明显优于k-均值算法,且产生的类别标签容易理解.

关 键 词:信息检索    搜索引擎    Web文档    聚类    概念分组  
收稿时间:2008-5-19
修稿时间:2008-8-22

Clustering Algorithm of Web Search Results Based on Conceptual Grouping
Li Hong-mei,Ding Zhen-guo,Zhou Shui-sheng,Zhou Li-hua.Clustering Algorithm of Web Search Results Based on Conceptual Grouping[J].Journal of South China University of Technology(Natural Science Edition),2009,37(1):130-134.
Authors:Li Hong-mei  Ding Zhen-guo  Zhou Shui-sheng  Zhou Li-hua
Institution:(1.School of Computer Science and Technology, Xidian University, Xi'an 710071, Shaanxi, China;2.School of Science, Xidian University, Xi'an 710071, Shaanxi, China)
Abstract:In order to facilitate the browse of the search results obtained by search engines and to rapidly and effectively find valuable Web documents, this paper proposes a new clustering algorithm of Web search results based on the conceptual grouping. In this algorithm, first, the co occurrence networks of characteristic terms are built. Next, the semantic relationships among characteristic terms are mined via the conceptual grouping to form different clusters related to the query topic. Then, the distances between the Web documents and the formed clusters are calculated for the clustering of Web search results. Finally, the cluster labels are selected according to the importance of characteristic terms in the search results and the clusters. It is indicated by experiments that the proposed algorithm performs better than the k-means algorithm, and that the labels selected by the algorithm are apprehensible.
Keywords:information retrieval" target="_blank">information retrieval')">information retrieval  search engine  Web document  clustering  conceptual grouping" target="_blank">')">conceptual grouping
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《华南理工大学学报(自然科学版)》浏览原始摘要信息
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号