首页 | 本学科首页   官方微博 | 高级检索  
     

基于语义提升HMM的语义标注
引用本文:李向阳,张亚非,陆建江. 基于语义提升HMM的语义标注[J]. 解放军理工大学学报(自然科学版), 2005, 6(1): 30-35
作者姓名:李向阳  张亚非  陆建江
作者单位:解放军理工大学,通信工程学院,江苏,南京,210007;解放军理工大学,训练部,江苏,南京,210007;东南大学,计算机科学与工程系,江苏,南京,210096
基金项目:国家自然科学基金资助项目 (60 3 0 3 0 2 4)
摘    要:语义标注所用标签数目众多,训练数据更为稀疏,用HMM作语义标注面临参数估计不准的难题。不同于传统的解决数据稀疏方法,以《同义词词林》的层次式结构为依据,提出了利用语义层次的提升来改善HMM(hidden Markov model)中参数的估计质量;在算法实现中,采用选择受限策略来解决因语义提升而引起的模型辨别力下降问题。测试表明,在训练数据相对稀疏的情况下,适度调整模型的语义层次可大幅提高语义标注的精度,该方法表现出较好的可塑性。

关 键 词:语义标注  隐马尔可夫模型  层次结构  自然语言处理
文章编号:1009-3443(2005)01-0030-06
修稿时间:2004-05-19

Semantic tagging using HMM with semantic induction
LI Xiang-yang,ZHANG Ya-fei and LU Jian-jiang. Semantic tagging using HMM with semantic induction[J]. Journal of PLA University of Science and Technology(Natural Science Edition), 2005, 6(1): 30-35
Authors:LI Xiang-yang  ZHANG Ya-fei  LU Jian-jiang
Affiliation:LI Xiang-yang~1,ZHANG Ya-fei~2,LU Jian-jiang~3
Abstract:It is difficult for hidden Markov models to get precise parameter estimation when applied to semantic tagging as the number of semantic tags is large and training data is insufficient. Different from classic method for solving data sparsity problem, a method which takes advantage of the hierarchical structure of Synonymy Thesaurus was presented to improve the quality of HMM parameter estimation by semantic induction. Restrictive selection policy was used to reverse the decline of model discriminability caused by the method. Tests indicate that the method is feasible to semantic tagging and tuning according to training data with size can greatly improve semantic tagging.
Keywords:semantic tagging  HMM(hidden Markov model)  hierarchy structure  natural language processing
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《解放军理工大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《解放军理工大学学报(自然科学版)》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号