Web网页语义树的构造与利用 The construct and use of semantic tree of Web pages期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

Web网页语义树的构造与利用

引用本文：	赵彦斌,李庆华,赵峰.Web网页语义树的构造与利用[J].华中科技大学学报(自然科学版),2005,33(Z1):229-231.

作者姓名：	赵彦斌李庆华赵峰

作者单位：	华中科技大学,计算机科学与技术学院,武汉,湖北,武汉,430074;华中科技大学,国家高性能计算中心(武汉),湖北,武汉,430074

基金项目：	国家自然科学基金资助项目(60273075)

摘要：	在分析不规范书写的Web网页的DOM树存在的树深度大、结点层次多、结点层次和子树间关系错误等问题的基础上,提出了一种容错的Web网页语义树构造方法,为文本分类与聚类、网络社区发现、Web主题信息的提取和基于主题的Web信息检索等技术的研究工作奠定了基础.
关键词：	Web网页 DOM 容错语义树语义树剪枝
文章编号：	1671-4512(2005)S1-0229-03
修稿时间：	2005年8月20日
The construct and use of semantic tree of Web pages

Zhao Yanbin,Li Qinghua,Zhao Feng.The construct and use of semantic tree of Web pages[J].JOURNAL OF HUAZHONG UNIVERSITY OF SCIENCE AND TECHNOLOGY.NATURE SCIENCE,2005,33(Z1):229-231.

Authors:	Zhao Yanbin Li Qinghua Zhao Feng

Institution:	Zhao Yanbin Li Qinghua Zhao Feng Doctoral Candidate,College of Computer Sci. & Tech.,Huazhong Univ.of Sci.& Tech.,Wuhan 430074,China.

Abstract:	After analyzing the ill-formed web page,its DOM tree has more hiberarchy node and inaccurate children.A fault-tolerance semantic tree of Web pages construct technique is proposed.The output of the semantic tree can be widely offered for research on text classification and Cluster,discovering network community,web topical information extraction and web information retrieval.

Keywords:	Web pages DOM fault-tolerance semantic tree semantic tree pruning
本文献已被 CNKI 万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏