A Research on Bilingual Dictionary Based Sentence Alignment for Chinese English Parallel Corpus |
| |
Authors: | Li Sheng Zhao Tiejun Lu Yajuan Liu Zhanyi |
| |
Affiliation: | Dept.of Computer Science and Engineering, Harbin Institute of Technology,Harbin 150001, P.R.China;Dept.of Computer Science and Engineering, Harbin Institute of Technology,Harbin 150001, P.R.China;Dept.of Computer Science and Engineering, Harbin Institute of Technology,Harbin 150001, P.R.China;Dept.of Computer Science and Engineering, Harbin Institute of Technology,Harbin 150001, P.R.China;Dept.of Computer Science and Engineering, Harbin Institute of Technology,Harbin 150001, P.R.China |
| |
Abstract: | Parallel corpus is of great importance to machine translation, and automatic sentence alignment is the first step towards its processing. This paper puts forward a bilingual dictionary based sentence alignment method for Chinese English parallel corpus, which differs from previous length based algorithm in its knowledge-rich approach. Experimental result shows that this method produces over 93% accuracy with usual English-Chinese dictionaries whose translations cover 31.88%~47.90% of the corpus. |
| |
Keywords: | Sentence alignment Parallel corpus Bilingual dictionary |
本文献已被 维普 万方数据 等数据库收录! |
|