首页 | 本学科首页   官方微博 | 高级检索  
     检索      

利用支持向量回归确定相关Web查询
引用本文:王继民,彭波,孟涛.利用支持向量回归确定相关Web查询[J].华南理工大学学报(自然科学版),2006,34(6):74-78,94.
作者姓名:王继民  彭波  孟涛
作者单位:1. 北京大学,信息管理系,北京,100871
2. 北京大学,信息科学技术学院,北京,100871
基金项目:国家自然科学基金;国家自然科学基金
摘    要:对用户输入的查询请求,如果搜索引擎系统能给出一个相关查询列表,将有助于用户进行查询修正,进而检索到用户所需要的信息.文中提出了一种利用支持向量回归确定相关Web查询的新方法.对一个给定的Web查询,首先从用户的使用记录中抽取候选查询的5个量化指标:被查询的次数、被查询的用户量、用户在反馈结果中的点击次数、与给定查询间的共有词项个数和点击相同网址(URL)的个数;然后用手工标记部分训练数据,进而建立支持向量回归模型,根据相关度的大小确定相关Web查询.实验结果表明该方法具有较高的准确度.

关 键 词:搜索引擎  用户日志  相关Web查询  支持向量回归
文章编号:1000-565X(2006)06-0074-05
收稿时间:2005-07-15
修稿时间:2005-07-15

Determination of Related Web Queries Using Support Vector Regression
Wang Ji-min,Peng Bo,Meng Tao.Determination of Related Web Queries Using Support Vector Regression[J].Journal of South China University of Technology(Natural Science Edition),2006,34(6):74-78,94.
Authors:Wang Ji-min  Peng Bo  Meng Tao
Institution:1. Dept. of Information Management, Peking Univ. , Beijing 100871, China; 2. School of Electronics Engineering and Computer Science, Peking Univ. , Beijing 100871, China
Abstract:When a user submits a Web query to a search engine,it is helpful for the user to modify the query and find the needed information if the system returns a list of related Web queries.This paper presents a new determination method of related Web queries using support vector regression.In this method,five quantified indexes of a candidate query are extracted from the log files,including the submitted number of the candidate query,the total numbers of submitting the candidate query and hitting the returned result,the number of common terms and the number of hitting common URL(Uniform Resource Locator) between the candidate query and the given query.The obtained candidate queries are then ranked based on support vector regression models learned from parts of human-labeled training data.The related Web queries are finally determined according to the relevance.Experimental results show that the proposed method is of high prediction precision.
Keywords:search engine  user log  related Web query  support vector regression
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号