首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于少量示例的个性化Web信息自动获取系统
引用本文:张春元,康耀红,雷景生.基于少量示例的个性化Web信息自动获取系统[J].郑州大学学报(理学版),2006,38(4):44-49.
作者姓名:张春元  康耀红  雷景生
作者单位:海南大学信息科学技术学院,海口,570228
摘    要:基于关键词的搜索引擎满足了人们一定的需要,但由于其通用的性质,并不能满足用户的个性化需求,为此,设计并实现了一个基于示例的个性化Web信息自动获取系统.该系统采用了一种新的基于少量Web示例网页和语料库词频统计的特征抽取算法和过滤阈值设定方法.实验结果表明,较基于关键词的搜索引擎而言,该系统能充分考虑用户的兴趣偏好(示例),长期、主动地向用户提供更加准确的Web信息获取服务.

关 键 词:个性化Web信息获取  Web信息过滤  特征抽取  少量Web文档示例
文章编号:1671-6841(2006)04-0044-06
收稿时间:05 1 2006 12:00AM
修稿时间:2006年5月1日

A Personalized Web Information Auto-retrieval System Based on Small Samples
ZHANG Chun-yuan,KANG Yao-hong,LEI Jing-sheng.A Personalized Web Information Auto-retrieval System Based on Small Samples[J].Journal of Zhengzhou University:Natural Science Edition,2006,38(4):44-49.
Authors:ZHANG Chun-yuan  KANG Yao-hong  LEI Jing-sheng
Institution:Institute of Information Science and Technology, Hainan University, Haikou 570228, China
Abstract:Although current search engines based on keywords satisfy some users' need,they can't meet users' personalized demands for their all-purpose characteristics.The design and implementation of a novel personalized Web information auto-retrieval system based on small samples is presented.This system adopts a new algorithm of feature extraction and a new method to determine filtering threshold based on small webpage training sets and term-frequency statistics of corpus.Experimental results show that this system can long-termly and on its own initiative provide more accurate Web information-obtaining service to a user according to his interest than the search engines based on keywords.
Keywords:personalized Web information retrieval  Web document filtering  feature extraction  small samples of Web documents
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号