首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于链接聚类的Shark-Search算法
引用本文:苏祺,项锟,孙斌.基于链接聚类的Shark-Search算法[J].山东大学学报(理学版),2006,41(3):1-04.
作者姓名:苏祺  项锟  孙斌
作者单位:北京大学,计算语言学研究所,北京,100871
基金项目:国家自然科学基金,中国科学院资助项目
摘    要:根据对Shark-Search主题爬取算法的分析,提出了一种基于链接聚类的改进Shark-Search算法. 并通过几个对比实验对该算法进行了验证. 实验结果表明,新算法能够更有效地识别链接与主题的相关性.

关 键 词:Shark-Search算法  主题爬取  链接聚类
收稿时间:2006-03-09

The Shark-Search algorithm based on clustering links
SU Qi,XIANG Kun,SUN Bin.The Shark-Search algorithm based on clustering links[J].Journal of Shandong University,2006,41(3):1-04.
Authors:SU Qi  XIANG Kun  SUN Bin
Institution:Institute of Computational Linguistics, Peking Univ., Beijing 100871, China
Abstract:Based on the analysis of the focused-crawling algorithm Shark-Search, an improved Shark-Search algorithm with link clustering is proposed. The new algorithm by several comparable experiments is validated. The results show that it could identify the relevance between link and focused topic more effectively.
Keywords:Shark-Search algorithm  focused crawling  link clustering
本文献已被 万方数据 等数据库收录!
点击此处可从《山东大学学报(理学版)》浏览原始摘要信息
点击此处可从《山东大学学报(理学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号