基于词性信息的汉语时间语词消歧算法 Statistical Approach Based on POS for Chinese Time Word Disambiguity期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

基于词性信息的汉语时间语词消歧算法

引用本文：	代建英,何中市.基于词性信息的汉语时间语词消歧算法[J].重庆大学学报(自然科学版),2005,28(9):53-56.

作者姓名：	代建英何中市

作者单位：	重庆大学,计算机学院,重庆,400030;重庆大学,计算机学院,重庆,400030

摘要：	切分歧义是影响汉语自动分词系统精度的一个重要因素.时间语词包括指明事件发生确定时间位置的时点时间词和指明动作或状态持续一段时间的时段时间词.基于现代汉语语料库加工规范,特定类型的时间语词存在切分歧义及考察时间语词的语用,提出了基于时间语词上下文词性信息的统计语言模型和基于极大似然原理的消解这类歧义的算法,其开放测试正确率约为90%.
关键词：	自然语言处理切分歧义时间语词词性信息统计语言模型
文章编号：	1000-582X（2005）09-0053-04
收稿时间：	2005-02-24
修稿时间：	2005年2月24日
Statistical Approach Based on POS for Chinese Time Word Disambiguity

DAI Jian-ying,HE Zhong-shi.Statistical Approach Based on POS for Chinese Time Word Disambiguity[J].Journal of Chongqing University(Natural Science Edition),2005,28(9):53-56.

Authors:	DAI Jian-ying HE Zhong-shi

Abstract:	Segmentation Ambiguity is an important factor influencing accuracy of Chinese auto-segmentation system. Time words include expressions both indicating exact time positions and those scattering in a treriod of time. On the foundations of modern Chinese corpus processing principles and certain type time word segmentation ambiguity, this paper proposes problem, a statistical language model and corresponding approach based on maximum likelihood to solve the ambiguous and it reaches a 90% accuracy which shows the effectiveness of the algorithm.

Keywords:	natural language processing segmentation ambiguity time word part of speech(POS) statistical language model
本文献已被 CNKI 维普万方数据等数据库收录！
	点击此处可从《重庆大学学报(自然科学版)》浏览原始摘要信息
	点击此处可从《重庆大学学报(自然科学版)》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏