首页 | 本学科首页   官方微博 | 高级检索  
     

基于最大匹配的中文分词改进算法研究
引用本文:赵源. 基于最大匹配的中文分词改进算法研究[J]. 科技信息, 2010, 0(35): 58-58,49
作者姓名:赵源
作者单位:中国人民武装警察部队吉林总队通信处,吉林长春130021
摘    要:本文在中文分词技术的基础上,提出了一种基于中文文本主题提取的分词方法,以概念语义网络的思想构造主题词典,描述词间概念语义关系,采用改进的最大匹配算法对文本进行切词,既提高了分词的准确性,又能识别文中的未登录词,并同步完成主题词的规范工作。从而在概念层次上理解用户的需求,实现概念检索,提高查准率。

关 键 词:中文分词  概念检索  词频统计

Research of Improved Chinese Word Segmentation Algorithm Based on MM Algorithm
ZHAO Yuan. Research of Improved Chinese Word Segmentation Algorithm Based on MM Algorithm[J]. Science, 2010, 0(35): 58-58,49
Authors:ZHAO Yuan
Affiliation:ZHAO Yuan (Communications Department, Jilin Corps of CAPF, Changchun Jilin, 130021,China)
Abstract:This paper puts forward a word segmentation method based on text subject extraction. It provides an Chinese Search Engine model based on Concept Retrieval, so that we can understand the user's requirement in conceptive level and fulfill the concept search and enhance the rate of precision. It uses an improved MM segmentation algorithm and constructs concept semantic network as dictionary, and standardizes thematic Words.
Keywords:Chinese Word Segmentation  Concept Retrieval  Frequency Statistics
本文献已被 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号