主题Web信息采集技术 Topic-Specific Web Information Collection Technology期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

主题Web信息采集技术

引用本文：	杜欢. 主题Web信息采集技术[J]. 四川理工学院学报(自然科学版), 2007, 20(5): 10-13

作者姓名：	杜欢

作者单位：	重庆邮电大学计算机学院,重庆,400065

摘要：	在互联网高速发展的今天,搜索引擎逐渐成为用户在Web上获取信息的主要工具。传统的通用搜索引擎利用一个Crawler程序面向整个Web进行信息采集,它的缺点是采集无针对性、页面失效率高、不能满足特定专业人群的需要。针对这种情况,需要一个分类细致精确、数据全面深入、更新及时的面向主题的搜索引擎。
关键词：	搜索引擎 Web Crawler 主题搜索引擎
文章编号：	1673-1549（2007）05-0010-04
收稿时间：	2007-05-15
Topic-Specific Web Information Collection Technology

DU Huan. Topic-Specific Web Information Collection Technology[J]. Journal of Sichuan University of Science & Engineering(Natural Science Editton), 2007, 20(5): 10-13

Authors:	DU Huan

Affiliation:	College of Computer Science and Technology, Chongqing University of Posts and Telecommunications, Chongqing 400065, China

Abstract:	Search engine has become people’s main access to gather information on the web.Traditional generic search engine use a program named Crawler to collect information from the whole Web,it has some disadvantages such as non-specific information collection,high rates of pages missing,and can not meet the needs of specific professional groups.What we need is a focused search engine,well classified,containing profound and entire data,and updating in time.

Keywords:	Web Crawler
本文献已被维普万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏