首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于内容相关性挖掘的反馈式搜索引擎框架
引用本文:侯越先,张鹏,于瑞国.基于内容相关性挖掘的反馈式搜索引擎框架[J].天津大学学报(自然科学与工程技术版),2008,41(8):941-945.
作者姓名:侯越先  张鹏  于瑞国
作者单位:天津大学计算机科学与技术学院,天津300072
基金项目:国家自然科学基金,天津市应用基础研究项目,微软亚洲研究院专项基金,海量科技基金
摘    要:当前主流的搜索引擎根据查询词在网页中的出现频率,辅以网页权威性等信息,生成查询结果.但用户提供的查询词往往非常简单,因此搜索引擎难以确定用户的查询意图.为此,给出了一种利用海量clickthrough数据进行网页内容相关性挖掘的方法,在此基础上给出了一种反馈式搜索引擎(FSE)框架及相关算法.FSE根据网页相关性动态生成查询结果,以期提供给用户更中肯和个性化的信息.基于真实点击数据,进行了网页相关性矩阵的压缩实验和有效性实验,证明了该框架的可行性.

关 键 词:WEB信息检索  反馈式搜索引擎  网页相关性  clickthrough数据

Framework of Feedback Search Engine Based on Content Relevance Mining
HOU Yue-xian,ZHANG Peng,YU Rui-guo.Framework of Feedback Search Engine Based on Content Relevance Mining[J].Journal of Tianjin University(Science and Technology),2008,41(8):941-945.
Authors:HOU Yue-xian  ZHANG Peng  YU Rui-guo
Institution:(School of Computer Science and Technology, Tianjin University, Tianjin 300072, China)
Abstract:Current mainstream search engines generate search results by analyzing statistical information such as the frequency of queries in web pages and the ranking of web pages. But search engines cannot determine what kind of information users want because queries are often simple in many situations. A web content relevance mining method was put forward which uses large amounts of clickthrough data. Furthermore, based on this method, a framework of feedback search engine (FSE)and associated algorithms were proposed. According to page-to-page relevance, FSE generated search results dynamically to provide users with more accurate and personalized information. Based on real clickthrough data, experiments on the compressibility and effectiveness of the web relevance matrix were performed. And the experimental results demonstrate the feasibility of the proposed framework.
Keywords:web information retrieval  feedback search engine  page-to-page relevance  clickthrough data
本文献已被 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号