基于标记树的WEB页面净化技术研究 Web Page Distillation Based on the Tag Tree期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

基于标记树的WEB页面净化技术研究

引用本文：	李明,张为群.基于标记树的WEB页面净化技术研究[J].西南师范大学学报(自然科学版),2006,31(5):128-131.

作者姓名：	李明张为群

作者单位：	1. 重庆教育学院,信息中心,重庆,400067 2. 西南大学,计算机与信息科学学院,重庆,400715

摘要：	根据Web页面标记建立标记树,通过分析,保留有用信息的标记子树,达到获取页面主要内容,净化页面的效果.
关键词：	标记树标记树模式页面净化
文章编号：	1000-5471（2006）05-0128-04
收稿时间：	2006-06-14
修稿时间：	2006年6月14日
Web Page Distillation Based on the Tag Tree

LI Ming,ZHANG Wei-qun.Web Page Distillation Based on the Tag Tree[J].Journal of Southwest China Normal University(Natural Science),2006,31(5):128-131.

Authors:	LI Ming ZHANG Wei-qun

Institution:	1. Information Center of Chongqing Educational College, Chongqing 400067; 2. School of Computer and Information Science, Southwest University, Chongqing 400715

Abstract:	It's the key problem that how to get the information people need of the internet through the computer. An arithmetic is put forward to solve this problem. At first a tag tree of the web page is constructed, then the authors divide the web page into several parts as Main part, Site flag, Navigation bar, Communication part, Copyrights, and the tag tree tells the relationship of these parts. The authors can parse the tag tree, get the child tag tree that only tells the Main part. So the main part is obtained and the web page is distilled.

Keywords:	tag tree tag tree model web page distillation
本文献已被 CNKI 维普万方数据等数据库收录！
	点击此处可从《西南师范大学学报(自然科学版)》浏览原始摘要信息
	点击此处可从《西南师范大学学报(自然科学版)》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏