首页 | 本学科首页   官方微博 | 高级检索  
     


A lexicalized second-order-HMM for ambiguity resolution in Chinese segmentation and POS tagging
Authors:Chen Yin  Yang Muyun  Zhao Tiejun  Yu Hao  Li Sheng
Abstract:Hidden Markov Model(HMM) is a main solution to ambiguities in Chinese segmentation and POS (part-of-speech) tagging. While most previous works for HMM-based Chinese segmentation and POS tagging consult POS information in contexts, they do not utilize lexical information which is crucial for resolving certain morphological ambiguity. This paper proposes a method which incorporates lexical information and wider context information into HMM. Model induction and related smoothing technique are presented in detail. Experiments indicate that this technique improves the segmentation and tagging accuracy by nearly 1%.
Keywords:hidden Markov model   chinese segmentation   part-of-speech tagging
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号