首页 | 本学科首页   官方微博 | 高级检索  
     检索      

A New Generalized Similarity-Based Topic Distillation Algorithm
作者姓名:ZHOU  Hongfang  DANG  Xiaohui
作者单位:[1]School of Computer Science and Engineering, Xi'an University of Technology, Xi'an 710048, Shaanxi, China [2]Xi'an Branch, Shaanxi Telecom Company Limited, Xi'an 710000, Shaanxi, China
基金项目:Supported by the Shaanxi Provincial Educational Depar tment Special-Purpose Technology and Research of China (06JX229)
摘    要:The procedure of hypertext induced topic search based on a semantic relation model is analyzed, and the reason for the topic drift of HITS algorithm was found to prove that Web pages are projected to a wrong latent semantic basis. A new concept-generalized similarity is introduced and, based on this, a new topic distillation algorithm GSTDA(generalized similarity based topic distillation algorithm) was presented to improve the quality of topic distillation. GSTDA was applied not only to avoid the topic drift, but also to explore relative topics to user query. The experimental results on 10 queries show that GSTDA reduces topic drift rate by 10% to 58% compared to that of HITS(hypertext induced topic search) algorithm, and discovers several relative topics to queries that have multiple meanings.

关 键 词:广义相似性  超文本感应主题搜索  网络  计算机技术
文章编号:1007-1202(2007)05-0789-04
收稿时间:6 February 2007
修稿时间:2007-02-06

A new generalized similarity-based topic distillation algorithm
ZHOU Hongfang DANG Xiaohui.A New Generalized Similarity-Based Topic Distillation Algorithm[J].Wuhan University Journal of Natural Sciences,2007,12(5):789-792.
Authors:Zhou Hongfang  Dang Xiaohui
Institution:(1) School of Computer Science and Engineering, Xi’an University of Technology, Xi’an, 710048, Shaanxi, China;(2) Xi’an Branch, Shaanxi Telecom Company Limited, Xi’an, 710000, Shaanxi, China
Abstract:The procedure of hypertext induced topic search based on a semantic relation model is analyzed, and the reason for the topic drift of HITS algorithm was found to prove that Web pages are projected to a wrong latent semantic basis. A new concept-generalized similarity is introduced and, based on this, a new topic distillation algorithm GSTDA(generalized similarity based topic distillation algorithm) was presented to improve the quality of topic distillation. GSTDA was applied not only to avoid the topic drift, but also to explore relative topics to user query. The experimental results on 10 queries show that GSTDA reduces topic drift rate by 10% to 58% compared to that of HITS(hypertext induced topic search) algorithm, and discovers several relative topics to queries that have multiple meanings. Biography: ZHOU Hongfang(1976–), female, Lecturer, Ph.D., research direction: data mining and knowledge discovery in databases.
Keywords:generalized similarity  hypertext induced topic search  topic distillation  topic drift
本文献已被 维普 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号