首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于清华汉语树库的复句关系词识别与分类研究
引用本文:李艳翠,孙静,周国栋,冯文贺.基于清华汉语树库的复句关系词识别与分类研究[J].北京大学学报(自然科学版),2014,50(1):118.
作者姓名:李艳翠  孙静  周国栋  冯文贺
作者单位:1. 苏州大学计算机科学与技术学院, 苏州 215006; 2. 河南科技学院信息工程学院, 新乡 453003; 3. 河南科技学院人文学院, 新乡 453003;
基金项目:863计划(2012AA011102);国家自然科学基金(61273320);教育部人文社会科学研究青年基金(13YJC740022)资助
摘    要:根据清华汉语树库的标注方法, 利用规则从中提取复句关系词并标注其类别, 然后分别抽取带功能标记和不带功能标记的自动句法树的句法、词法、位置特征, 进行复句关系词的识别和分类。实验结果表明, 复句关系词判断准确率达95.7%, 复句关系词类别判断F1值为77.2%。

关 键 词:复句关系词  清华汉语树库  关系词识别  关系词分类  
收稿时间:2013-06-15

Recognition and Classification of Relation Words in the Compound Sentences Based on Tsinghua Chinese Treebank
LI Yancui,SUN Jing,ZHOU Guodong,FENG Wenhe.Recognition and Classification of Relation Words in the Compound Sentences Based on Tsinghua Chinese Treebank[J].Acta Scientiarum Naturalium Universitatis Pekinensis,2014,50(1):118.
Authors:LI Yancui  SUN Jing  ZHOU Guodong  FENG Wenhe
Institution:1. Department of Computer Science and Technology, Soochow University, Suzhou 215006; 2. School of Information Engineering, Henan Institute of Science and Technology, Xinxiang 453003; 3. School of humanities, Henan Institute of Science and Technology, Xinxiang 453003;
Abstract:According to Tsinghua Chinese Treebank annotation methods, the authors extracted relation words and marked their categories. Then syntax, lexical and position features of automatic syntax tree with and without functional marker were extracted to recognize and classify relation words. Experiment results show that relative recognition accuracy is 95.7%, and relation words classification F1 is 77.2%.
Keywords:relation words in compound sentences  Tsinghua Chinese Treebank  relation words recognition  relation words classification  
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《北京大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《北京大学学报(自然科学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号