首页 | 本学科首页   官方微博 | 高级检索  
     检索      

Web-Based Information Extraction Technology
作者姓名:孙铁利  教巍巍  刘淑华
作者单位:[1]School of Computer Science, Northeast Normal University, Changchun 130024, China [2]School of Computer Science and Engineering, Liaoning University of Technology, Jinzhou, Liaoning 121001, China
摘    要:

关 键 词:HTML  XML  信息提取  隐马尔可夫模型
文章编号:1672-5220(2007)02-0288-05
修稿时间:2006-08-20

Web-Based Information Extraction Technology
SUN Tie-li,JIAO Wei-wei,LIU Shu-hua.Web-Based Information Extraction Technology[J].Journal of Donghua University,2007,24(2):288-292.
Authors:SUN Tie-li  JIAO Wei-wei  LIU Shu-hua
Institution:1. School of Computer Science,Northeast Normal University, Changchun 130024, China
2. School of Computer Science and Engineering, Liaoning University of Technology, Jinzhou, Liaoning 121001, China
Abstract:Information extraction techniques on the Web are the current research hotspot. Now many information extraction techniques based on different principles have appeared and have different capabilities. We classify the existing information extraction techniques by the principle of information extraction and analyze the methods and principles of semantic information adding, schema defining,rule expression, semantic items locating and object locating in the approaches. Based on the above survey and analysis,several open problems are discussed.
Keywords:HTML  XML  rule  semantic  information extraction  Hidden Markov model
本文献已被 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号