首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Chinese word segmentation with local and global context representation learning
Authors:Li Yan  Zhang Yinghua  Huang Xiaoping  Yin Xucheng  Hao Hongwei
Institution:1. School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing 100083, P.R.China
2. Institute of Automation, Chinese Academy of Sciences, Beijing 100190, P.R.China
Abstract:A local and global context representation learning model for Chinese characters is designed and a Chinese word segmentation method based on character representations is proposed in this paper.First,the proposed Chinese character learning model uses the semantics of local context and global context to learn the representation of Chinese characters.Then,Chinese word segmentation model is built by a neural network,while the segmentation model is trained with the character representations as its input features.Finally,experimental results show that Chinese character representations can effectively learn the semantic information.Characters with similar semantics cluster together in the visualize space.Moreover,the proposed Chinese word segmentation model also achieves a pretty good improvement on precision,recall and f-measure.
Keywords:local and global context  representation learning  Chinese character representation  Chinese word segmentation
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号