数据库受限汉语自然语言查询的分词研究与实现 The research and implementation of word segmentation of database natural language query based on restricted Chinese期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

数据库受限汉语自然语言查询的分词研究与实现

引用本文：	胡婕,李跃新.数据库受限汉语自然语言查询的分词研究与实现[J].湖北大学学报(自然科学版),2005,27(4):331-335.

作者姓名：	胡婕李跃新

作者单位：	湖北大学,数学与计算机科学学院,湖北,武汉,430062;湖北大学,数学与计算机科学学院,湖北,武汉,430062

摘要：	对数据库受限汉语自然语言查询语句进行分渊处理．分词算法分为两个部分，第一部分对最大匹配法进行改进，改进的核心思想足体现整句长词优先的原则，改进后的算法能够减少切分歧义；第二部分根据实例数据库的查询需要处理姓名和不稳定的属性值两类未登录词，未登录词的识别对后续句子的理解起着至关重要的作用．
关键词：	受限汉语自然语言分词算法最大匹配法长词优先未登录词
文章编号：	1000-2375(2005)04-0331-05
收稿时间：	10 21 2004 12:00AM
修稿时间：	2004年10月21
The research and implementation of word segmentation of database natural language query based on restricted Chinese

HU Jie, LI Yue-xin.The research and implementation of word segmentation of database natural language query based on restricted Chinese[J].Journal of Hubei University(Natural Science Edition),2005,27(4):331-335.

Authors:	HU Jie LI Yue-xin

Institution:	School of Mathematics and Computer Science, Hubei University, Wuhan 430062, China

Abstract:	This paper describes the word segmentation of database natural language query based on restricted Chinese. The word segmentation algorithm is made up of two parts. The first part improves the maximum matching segmentation algorithm that fully embodies the principle of priority of long word on a whole sentence. The improved algorithm can decrease the ambiguity of segmentation. The second part processes two sort of unlisted words that are name and unstable property value according to the requirement of instance database. The recognition of unlisted words plays an important role in the following understanding to sentence.

Keywords:	natural language based on restricted Chinese word segmentation algorithm maximum matching(MM) segmentation algorithm priority of long word unlisted word
本文献已被 CNKI 维普万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏