首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于二字词位图表的汉语自动分词词典机制
引用本文:蒋斌,杨超,赵欢.基于二字词位图表的汉语自动分词词典机制[J].湖南大学学报(自然科学版),2006,33(1):121-123.
作者姓名:蒋斌  杨超  赵欢
作者单位:湖南大学,计算机科学与通信学院,湖南,长沙,410082;湖南大学,计算机科学与通信学院,湖南,长沙,410082;湖南大学,计算机科学与通信学院,湖南,长沙,410082
基金项目:湖南省自然科学基金资助项目(03JJY3097)
摘    要:根据汉语中二字词较多的特点,提出了一种新的分词词典机制.该机制在词典数据结构中添加二字词检测位图表,在分词时,利用位图表可快速判断二字词优化分词速度.选取人民日报语料片断进行了实验测试.实验结果表明,基于二字词检测位图表的分词词典机制有效地提高了汉语自动分词的速度和效率.

关 键 词:汉语自动分词  分词词典机制  二字词检测位图表
文章编号:1000-2472(2006)01-0121-03
收稿时间:2005-04-27
修稿时间:2005-04-27

A Kind of Dictionary Mechanism Based on the Two-Word-Bitmap for Chinese Word Segmentation
JIANG Bin,YANG Chao,ZHAO Huan.A Kind of Dictionary Mechanism Based on the Two-Word-Bitmap for Chinese Word Segmentation[J].Journal of Hunan University(Naturnal Science),2006,33(1):121-123.
Authors:JIANG Bin  YANG Chao  ZHAO Huan
Institution:College of Computer and Communication, Hunan Univ, Changsha, Hunan 410082, China
Abstract:According to the characteristics that two-word words are abundant in Chinese,this paper put forward a new dictionary mechanism,which added two-word-bitmap into the data structure.The speed of word segmentation can be improved by using this two-word-bitmap to judge whether two single words could be a two-word or not.Furthermore,some experiments were done to test the algorithm.The results showed that the new dictionary mechanism based on the two-word-bitmap could improve speed and achieve more efficiency in Chinese word segmentation.
Keywords:Chinese word segmentation  dictionary mechanism  two-word-bitmap
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《湖南大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《湖南大学学报(自然科学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号