首页 | 本学科首页   官方微博 | 高级检索  
     检索      

面向计算机辅助翻译的民航规章术语库词性规则研究
引用本文:王坤.面向计算机辅助翻译的民航规章术语库词性规则研究[J].中国科技术语,2022,24(2):65-69.
作者姓名:王坤
作者单位:中国民航大学外国语学院,天津 300300
基金项目:中国民航大学中央高校基金项目“英汉翻译中的透明话语策略研究”(3122018R010)
摘    要:当前主流计算机辅助翻译系统(CAT)借助翻译记忆(TM)和术语库(TB)提高翻译效率。翻译记忆以自然句为主要匹配单位,需要整句相似或重复,匹配难度大。与之相比,术语库以词块为匹配单位,较为灵活,可弥补翻译记忆的缺陷。术语库的构建涉及术语自动提取,需要参考特定文本类型中高频语块的词性规则。文章使用n-gram提取英语民航规章文本的复现语块,探究不同词项长度和复现频数下高频语块的词性组合特征;并将其与文学文本进行对比。研究发现,在英语民航规章文本中,适用于计算机辅助翻译系统术语库的复现语块以名词短语为主,与文学文本存在显著差异。

关 键 词:计算机辅助翻译  术语库  n-gram  民航规章  
收稿时间:2021-10-12
修稿时间:2022-03-08

Analysis on POS Configuration for Civil Aviation Regulations Termbase based on CAT System
WANG Kun.Analysis on POS Configuration for Civil Aviation Regulations Termbase based on CAT System[J].Chinese Science and Technology Terms Journal,2022,24(2):65-69.
Authors:WANG Kun
Abstract:Most of the current CAT systems leverage Translation Memory (TM) and Termbase(TB) to enhance efficiency of translation. With respect to TM, due to its limitations in practice, whole sentence repetition often should be complemented by translation termbase, which is more flexible in use. Building a termbase requires the automatic extraction of terms, which demands knowledge of its POS (part of speech) configuration in the specific text typology. With corpus tools, we extracted n-grams of certain length and frequency from Civil Aviation Regulations in the US and examined the POS configuration of those recurrent chunks, followed by a contrast with that of literary texts. The study shows a dominance of NP and PP in recurrent chunks suitable for CAT termbase in those Civil Aviation Regulations, different from the result in literary texts.
Keywords:Computer Aided Translation(CAT)  termbase  n-gram  civil aviation regulations  
点击此处可从《中国科技术语》浏览原始摘要信息
点击此处可从《中国科技术语》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号