首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于分类和关键词组抽取的信息检索算法
引用本文:钟敏娟,林亚平,陈治平.基于分类和关键词组抽取的信息检索算法[J].系统仿真学报,2004,16(5):1009-1012,1016.
作者姓名:钟敏娟  林亚平  陈治平
作者单位:湖南大学计算机与通信学院,长沙,410082
基金项目:国家自然科学基金(60272051)
摘    要:本文提出一种基于分类和关键词组抽取的信息检索算法。该算法利用文本分类和信息抽取技术辅助检索,避免了向量空间模型算法中时间复杂度过大,查准率不高的缺点。针对传统的信息检索性能指标无法有效地衡量检索结果的排序状况,本文还引入了排序误差率概念用于评价检索结果的排序。实验结果表明,所提算法与TFIDF算法、基于分类的交互式检索算法相比,具有更快的查询速度,更高的查准率和更小的排序误差率。

关 键 词:向量空间模型  文本分类  关键词组抽取  查准率  排序误差率
文章编号:1004-731X(2004)05-1009-04

An Information-retrieval Algorithm Based on Classification and Key Phrase Extraction
ZHONG Min-juan,LIN Ya-ping,CHEN Zhi-ping.An Information-retrieval Algorithm Based on Classification and Key Phrase Extraction[J].Journal of System Simulation,2004,16(5):1009-1012,1016.
Authors:ZHONG Min-juan  LIN Ya-ping  CHEN Zhi-ping
Abstract:In this paper, a new information retrieval algorithm based on classification and key phrase extraction is proposed. Compared with traditional vector space model, this algorithm reduces time complexity and improves precision using of text classification and information extraction. Then a new performance criterion named ranking error is contributed to solve the problem that the traditional performance evaluation methodology cant evaluate the ranking results of retrieved documents efficiently. The experiment result shows that the proposed algorithm outperforms TF*IDF and Interactive Retrieval based on classification in speed, precision and ranking error.
Keywords:vector space model  text classification  key phrase extraction  precision  ranking error
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号