A lexicalized second-order-HMM for ambiguity resolution in Chinese segmentation and POS tagging |
| |
Authors: | Chen Yin Yang Muyun Zhao Tiejun Yu Hao Li Sheng |
| |
Abstract: | Hidden Markov Model(HMM) is a main solution to ambiguities in Chinese segmentation and POS (part-of-speech) tagging. While most previous works for HMM-based Chinese segmentation and POS tagging consult POS information in contexts, they do not utilize lexical information which is crucial for resolving certain morphological ambiguity. This paper proposes a method which incorporates lexical information and wider context information into HMM. Model induction and related smoothing technique are presented in detail. Experiments indicate that this technique improves the segmentation and tagging accuracy by nearly 1%. |
| |
Keywords: | hidden Markov model chinese segmentation part-of-speech tagging |
本文献已被 CNKI 维普 万方数据 等数据库收录! |