首页 | 本学科首页   官方微博 | 高级检索  
     检索      


A New Word Detection Method for Chinese Based on Local Context Information
Authors:ZENG Hua-lin  ZHOU Chang-le  ZHENG Xu-ling
Institution:Department of Cognitive Science, Fujian Key Laboratory of the Brain-like Intelligent Systems, Xiamen University, Xiamen 361005, China
Abstract:Finding out out-of-vocabulary words is an urgent and difficult task in Chinese words segmentation. To avoid the defect causing by offline training in the traditional method, the paper ptoposes an improved prediction by partical match (PPM) segmenting algorithm for Chinese words based on extracting local context information, which adds the context information of the testing text into the local PPM statistical model so as to guide the detection of new words. The algorithm focuses on the process of online segmentation and new word detection which achieves a good effect in the close or opening test, and outperforms some well-known Chinese segmentation system to a certain extent.
Keywords:new word detection  improved PPM model  context information  Chinese words segmentation
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号