首页 | 本学科首页   官方微博 | 高级检索  
     

基于语境关联的Web信息过滤算法
引用本文:席萌,郭巧. 基于语境关联的Web信息过滤算法[J]. 华中科技大学学报(自然科学版), 2003, 0(Z1)
作者姓名:席萌  郭巧
作者单位:北京理工大学网络信息中心
摘    要:设计一个文本过滤实验 ,首先从语料库的词频统计结果中挖掘出词频的二元关联度 ,然后用一个Hop field网络将词频的二元关联关系转化为语境关联关系 ,训练语言单位在整个上下文环境下的权重 ,并建立用户模板 .该算法改善了词频特征提取算法与文本上下文环境的匹配状况 ,实验结果表明 ,对专业性Web文档的过滤可达到更高的精确度

关 键 词:信息过滤  语境  词频  关联度  Hopfield神经网络

Web information filtering arithmetic based on context relevancy
Xi Meng Guo Qiao Postgraduate, Network Information Center,Beijing Institute of Technology,Beijing ,China.. Web information filtering arithmetic based on context relevancy[J]. JOURNAL OF HUAZHONG UNIVERSITY OF SCIENCE AND TECHNOLOGY.NATURE SCIENCE, 2003, 0(Z1)
Authors:Xi Meng Guo Qiao Postgraduate   Network Information Center  Beijing Institute of Technology  Beijing   China.
Affiliation:Xi Meng Guo Qiao Postgraduate, Network Information Center,Beijing Institute of Technology,Beijing 100081,China.
Abstract:Web Information Filtering, which is to filter the most expected documents from the target Web Set. As known from our reading experience, the context information contained in user-interested documents is an indispensability fact in estimating text subjects. To design a text-filter experiment, first, the dualistic relevancy of word coexistence frequency should be mined by the word frequency Stat. from corpus, then a Hopfield NN can be used to convert the dualistic relevancy to context relevancy and to train the weights of language units in the whole context, at last the user-template is built. This arithmetic can improve the matching condition between word frequency characters and the context information. The result of experiment shows that more precision can be achieved for professional Web text filtering.
Keywords:information filtering  context  word frequency  relevancy  Hopfield NN
本文献已被 CNKI 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号