首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于选择偏向性的统计机器翻译模型
引用本文:唐海庆,熊德意.基于选择偏向性的统计机器翻译模型[J].北京大学学报(自然科学版),2016,52(1):127-133.
作者姓名:唐海庆  熊德意
作者单位:苏州大学计算机科学与技术学院, 苏州 215006
基金项目:国家自然科学基金青年基金,江苏省自然科学基金青年基金
摘    要:针对基于短语的统计机器翻译使用有限的语义知识, 导致长距离的动宾短语对翻译质量不高的问题, 提出基于动词选择偏向性的翻译模型, 引入动词对宾语的语义约束信息, 为动词找到合适的宾语翻译。首先使用条件概率方法, 训练动词对宾语的选择偏向性, 然后将选择偏向性作为一个新特征, 集成到基于短语的翻译系统中。在大规模测试数据集上完成汉语到英语的翻译, 实验结果表明, 基于选择偏向性的翻译模型能够很好地捕获长距离的语义依赖关系, 从而提高译文质量。

关 键 词:语义知识  选择偏向性  语义约束  语义依赖  />  
收稿时间:2015-06-19

A Selectional Preference Based Translation Model for SMT
TANG Haiqing,XIONG Deyi.A Selectional Preference Based Translation Model for SMT[J].Acta Scientiarum Naturalium Universitatis Pekinensis,2016,52(1):127-133.
Authors:TANG Haiqing  XIONG Deyi
Institution:School of Computer Science and Technology, Soochow University, Suzhou 215006
Abstract:The limited semantic knowledge is used in the phrase-based statistical machine translation (SMT), which causes that the translation quality of long-distance verb and its object is low. A selectional preference based translation model is proposed, which inducts the semantic constraints that a verb imposes on its object to select the proper argument-head word for the predicate with long distance. The authors train the corpus to obtain the conditional probability based selectional preferences for verb, and integrate the selectional preferences into a phrase-based translation system and evaluate on a Chinese-to-English translation task with large-scale training data. Experiment results show that the integration of selectional preference into SMT can effectively capture the long-distance semantic dependencies and improve the translation quality.
Keywords:semantic knowledge  selectional preference  semantic constraints  semantic dependencies
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《北京大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《北京大学学报(自然科学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号