首页 | 本学科首页   官方微博 | 高级检索  
     检索      

Two-stage approach to full Chinese parsing
作者姓名:曹海龙  Zhao  Tiejun  Yang  Muyun  Li  Sheng
作者单位:MOE-MS Key Laboratory of Natural Language Processing and Speech, Harbin Institute of Technology, Harbin 150001, P.R. China
基金项目:Supported by the High Technology Research and Development Programme of China (No. 2004AA117010-09) and National Natural Science Foundation of China (No. 60302021 ).
摘    要:0IntroductionInthe past decades,great progress has been madeinthe field of Chinese word segmentation,part of speechtagging and partial parsing.Nowfull Chinese parsing,thenext step essential to Chinese understanding,is attractingmore and more attentionintheinternational researchcom-munity1-3].It is a challenging problemfor two reasons.First,like any natural language,there is much ambiguityin Chinese sentences resultingin huge searchspace.Sec-ond,Chinese has many different linguistic phenome…

关 键 词:自然语言处理系统  马尔可夫模型  模式识别  句法分析  汉语
收稿时间:2004-06-21

Two-stage approach to full Chinese parsing
Cao Hailong,Zhao Tiejun,Yang Muyun,Li Sheng.Two-stage approach to full Chinese parsing[J].High Technology Letters,2005,11(4):359-363.
Authors:Cao Hailong  Zhao Tiejun  Yang Muyun  Li Sheng
Abstract:Natural language parsing is a task of great importance and extreme difficulty. In this paper, we present a full Chinese parsing system based on a two-stage approach. Rather than identifying all phrases by a uniform model, we utilize a divide and conquer strategy. We propose an effective and fast method based on Markov model to identify the base phrases. Then we make the first attempt to extend one of the best English parsing models i.e. the head-driven model to recognize Chinese complex phrases. Our two-stage approach is superior to the uniform approach in two aspects. First, it creates synergy between the Markov model and the head-driven model. Second, it reduces the complexity of full Chinese parsing and makes the parsing system space and time efficient. We evaluate our approach in PARSEVAL measures on the open test set, the parsing system performances at 87.53% precision, 87.95% recall.
Keywords:natural language processing systems  parsing  markov model  pattern recognition
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号