首页 | 本学科首页   官方微博 | 高级检索  
     检索      

用有向图法解决网页爬行中循环链接问题
引用本文:赫枫龄,左万利.用有向图法解决网页爬行中循环链接问题[J].吉林大学学报(理学版),2004,42(3):402-404.
作者姓名:赫枫龄  左万利
作者单位:吉林大学计算机科学与技术学院, 长春 130012
基金项目:国家自然科学基金(批准号:60373099).
摘    要:提出网页构成的有向回路问题, 描述了由网页构成有向图的形式定义, 并给出了用有向图法发现网页构成的有向回路算法. 所给定的算法能使网页爬行器避免掉入由已爬行过的网页构成的有向回路陷阱.

关 键 词:爬行器  网络搜索引擎  超链接  有向图  
文章编号:1671-5489(2004)03-0402-03
收稿时间:2003-12-17
修稿时间:2003年12月17

Solve the cycle links problem in Internet crawling by directed graph
HE Feng-ling,ZUO Wan-Li.Solve the cycle links problem in Internet crawling by directed graph[J].Journal of Jilin University: Sci Ed,2004,42(3):402-404.
Authors:HE Feng-ling  ZUO Wan-Li
Institution:College of Computer Science and Technology, Jilin University, Chan gchun 130012, China
Abstract:The present paper deals with the technique how to solve the problem of cycle links in internet crawling by directed graph. First, the problem is proposed. Then, the formal definition of cycle links in internet crawling is described. Finally, the algorithm to solve the problem by directed graph is given. The key problem to a crawler is how to find directed loops effectively in web pages crawled by the crawler. The algorithm described in this paper can make the crawler avoid dropping in the pitfall created by cycle links.
Keywords:crawler  internet search engine  hyperlink  directed graph
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《吉林大学学报(理学版)》浏览原始摘要信息
点击此处可从《吉林大学学报(理学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号