首页 | 本学科首页   官方微博 | 高级检索  
     检索      

Automatic Data Extraction from Websites for Generating Aquatic Product Market Information
作者姓名:袁红春  陈莹  孙越夫
作者单位:[1]College of Information Technology, Shanghai Fisheries University, Shanghai 200090 [2]School of Information Systems, University of Tasmania, Australia
基金项目:Supported by the Shanghai Education Committee (No. 06KZ016)
摘    要:The massive web-based information resources have led to an increasing demand for effective automatic retrieval of target information for web applications. This paper introduces a web-based data extraction tool that deploys various algorithms to locate, extract and filter tabular data from HTML pages and to transform them into new web-based representations. The tool has been applied in an aquaculture web application platform for extracting and generating aquatic product market information. Results prove that this tool is very effective in extracting the required data from web pages.

关 键 词:网站  水产品  数据分析  数据收集  网页
收稿时间:2006-08-20

Automatic Data Extraction from Websites for Generating Aquatic Product Market Information
YUAN Hong-chun,CHEN Ying,SUN Yue-fu.Automatic Data Extraction from Websites for Generating Aquatic Product Market Information[J].Journal of Donghua University,2006,23(6):15-19.
Authors:YUAN Hong-chun  CHEN Ying  SUN Yue-fu
Institution:1. College of Information Technology, Shanghai Fisheries University, Shanghai 200090
2. School of Information Systems, University of Tasmania, Australia
Abstract:The massive web-based information resources have led to an increasing demand for effective automatic retrieval of target information for web applications. This paper introduces a web-based data extraction tool that deploys various algorithms to locate, extract and filter tabular data from HTML pages and to transform them into new web-based representations. The tool has been applied in an aquaculture web application platform for extracting and generating aquatic product market information.Results prove that this tool is very effective in extracting the required data from web pages.
Keywords:web data  table localization algorithm  distance algorithm  data filtering algorithm  data extraction tool
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号