首页 | 本学科首页   官方微博 | 高级检索  
     检索      

一种基于名词短语的检索结果多层聚类方法
引用本文:庞观松,张黎莎,蒋盛益,邝丽敏,吴美玲.一种基于名词短语的检索结果多层聚类方法[J].山东大学学报(理学版),2010,45(7):39-44.
作者姓名:庞观松  张黎莎  蒋盛益  邝丽敏  吴美玲
作者单位:广东外语外贸大学信息学院, 广东 广州 510420
基金项目:国家自然科学基金资助项目,广东省高等学校自然科学研究重点项目,广东省自然科学基金资助项目 
摘    要:为了对检索结果获取高质量的聚类效果,提取名词短语作为候选类别标签,根据候选类别标签分布情况生成基础类,再使用具有线性时间复杂度的一趟聚类算法对基础类进行多层聚类。与NEC,STC和Lingo算法的对比实验表明:该方法在类别标签的可读性、有效性以及聚类性能上都优于以上3种方法。

关 键 词:信息检索  检索结果聚类  文本聚类  多层聚类  
收稿时间:2010-04-02

A multi-level clustering approach based on noun phrases for search results
PANG Guan-song,ZHANG Li-sha,JIANG Sheng-yi,KUANG Li-min,WU Mei-ling.A multi-level clustering approach based on noun phrases for search results[J].Journal of Shandong University,2010,45(7):39-44.
Authors:PANG Guan-song  ZHANG Li-sha  JIANG Sheng-yi  KUANG Li-min  WU Mei-ling
Institution:School of Informatics, Guangdong University of Foreign Studies, Guangzhou 510420, Guangdong, China
Abstract:In order to  get high qualitative clustering results, the noun phrases was selected as candidate cluster labels and generates basic clusters based on the distribution of candidate cluster labels. And then multi-level clustering was proceeded on basic clusters by using one pass clustering algorithm with linear time complexity. The comparative experiment was carried with our method, NEC algorithm, STC algorithm and Lingo  algorithm, and the results showed that our method could get more informative, readable cluster labels and more effective than other three methods.
Keywords:information retrieval  search results clustering  text clustering  multi-level clustering
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《山东大学学报(理学版)》浏览原始摘要信息
点击此处可从《山东大学学报(理学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号