首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于移动代理的MAISE爬虫的设计与实现
引用本文:石柯,周利兵,陶文兵.基于移动代理的MAISE爬虫的设计与实现[J].华中科技大学学报(自然科学版),2005,33(Z1):226-228.
作者姓名:石柯  周利兵  陶文兵
作者单位:华中科技大学,计算机科学与技术学院,湖北,武汉,430074
基金项目:中国教育科研网格计划ChinaGrid资助项目(CG2003-GA001)
摘    要:提出了一种基于移动代理的图像搜索引擎(MAISE,Mobile Agent based Image Search Engine)的爬虫系统,系统中爬虫代理运行在远程Web服务器上,它将集中在服务器端的任务如:特征提取、建立索引等分散到远程的Web服务器上并行运行,而且代理个数是可控的,最后将少量的数据回传到服务器端,这不仅提高了效率而且减小了网络传输量.最后对MAISE爬虫系统进行了测试,实验结果表明,MAISE爬虫的网络数据传输量和爬行时间等指标上均优于传统爬虫.

关 键 词:搜索引擎  网络爬虫  移动代理  并行化搜索
文章编号:1671-4512(2005)S1-0226-03
修稿时间:2005年8月19日

Design and implementation of MAISE CRAWLER based on mobile agent
Shi Ke,Zhou Libing,Tao Wenbing.Design and implementation of MAISE CRAWLER based on mobile agent[J].JOURNAL OF HUAZHONG UNIVERSITY OF SCIENCE AND TECHNOLOGY.NATURE SCIENCE,2005,33(Z1):226-228.
Authors:Shi Ke  Zhou Libing  Tao Wenbing
Institution:Shi Ke Zhou Libing Tao Wenbing Assoc.Prof.,College of Computer Sci.& Tech.,Huazhong Univ.of Sci.& Tech.,Wuhan 430074,China.
Abstract:A mobile agent based crawler system for Image Search Engine is proposed in this paper to address this issue.In our system,the crawlers are implemented as mobile agents that can run on the remote servers,which lead to most computing-intensive tasks,i.e.feature extracting,indexing,can be parallelized and carried out on different remote web servers.Cooperatively executing computing-intensive tasks on different servers by multiple crawlers makes a great improvement in processing speed.Moreover,only necessary processing results need to be transferred among different servers which decrease network traffic obviously.A prototype system is built and performance test demonstrates it outperforms traditional crawler systems.
Keywords:image search engine  Web crawler  mobile agent  parallel search
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号