首页 | 本学科首页   官方微博 | 高级检索  
     检索      

大规模单语语料的索引及检索
引用本文:康伟.大规模单语语料的索引及检索[J].鞍山科技大学学报,2007,30(1):40-43.
作者姓名:康伟
作者单位:鞍山师范学院高等职业技术学院,辽宁鞍山114011
摘    要:针对大规模单语语料资源,提出了采用B-tree结构的二级索引机制;研究了索引及检索关键字的组织策略,引入了检索关键字的词频因素,通过关键字的分组及短语的识别策略,有效地解决了检索效率和准确率问题.

关 键 词:语料库  词频  二级索引  检索
文章编号:1672-4410(2007)01-0040-04
收稿时间:2006-12-30
修稿时间:2006-12-30

Indexing and retrieval of large scale monolingual corpus
KANG Wei.Indexing and retrieval of large scale monolingual corpus[J].Journal of Anshan University of Science and Technology,2007,30(1):40-43.
Authors:KANG Wei
Institution:High Vocational Technology Education School, Anshan Normal University, Anshan 114011, China
Abstract:In view of large-scale single language materials resources, proposed uses B-tree thc structure two level of index mechanism;This paper has studied the index and the retrieval key words organiT.ation strategy, has introduced the retrieval word frequency factor to key words, has solved the retrieval efficiency prob- lem effectively, simultaneously, enable the retrieval through the key words grouping and the phrase recognition strategy the rate of accuracy to have the large scale enhancement.
Keywords:corpus  word frequency  two level of index  retrieval
本文献已被 CNKI 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号