首页 | 本学科首页   官方微博 | 高级检索  
     

基于维吾尔文的聚焦策略爬虫技术研究
引用本文:阿依努尔·阿布瓦依提. 基于维吾尔文的聚焦策略爬虫技术研究[J]. 新疆师范大学学报(自然科学版), 2014, 0(4): 75-78
作者姓名:阿依努尔·阿布瓦依提
作者单位:新疆师范大学 信息管理中心,新疆 乌鲁木齐,830054
摘    要:随着网络资源的不断丰富,人们获取信息的途径已被网络代替。维吾尔文,在语言信息处理,WEB应用等领域有了迅速的发展。文章针对网络爬虫的工作原理以及聚焦爬虫策略进行阐述,在此基础上结合维吾尔语信息提取的相关研究,研究了维吾尔文的网络爬虫技术的结构和策略,从而为维吾尔文搜索引擎的网页数据库建设和维吾尔文网络舆情分析研究提供海量的语料。

关 键 词:网络爬虫  维吾尔文聚焦策略  维吾尔文搜索引擎

Study on Focused Crawler for Uyghur langange
Aynur·ABDUWAYIT. Study on Focused Crawler for Uyghur langange[J]. Journal of Xinjiang Normal University(Natural Sciences Edition), 2014, 0(4): 75-78
Authors:Aynur·ABDUWAYIT
Affiliation:Aynur·ABDUWAYIT(Information Managment Center, Xinjiang Normal University, Urumqi, Xinjiang, 830054, China)
Abstract:The way people getting various information have gradually been replaced by the vast growing Inter-net,along with rich online resources. as for this, Uyghur language have developed very fast in many research fields, in which natural language processing and Web application. This paper, mainly presented basic theory of web crawl-er and strategy of focused carawler, on the basis of study on Uyghur information extraction. Then discussed Uyghur web crawler in both structural and strategic way. Thus, massively provided large rage corpus for Uyghur search en-gine and Uyghur public network analysis.
Keywords:Web crawler  Uyghur Web crawler  Uyghur search engine
本文献已被 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号