首页 | 本学科首页   官方微博 | 高级检索  
     

中文嵌套命名实体关系抽取研究
引用本文:许浩亮,李雁群,何云琪,钱龙华. 中文嵌套命名实体关系抽取研究[J]. 北京大学学报(自然科学版), 2019, 55(1): 8-14. DOI: 10.13209/j.0479-8023.2018.056
作者姓名:许浩亮  李雁群  何云琪  钱龙华
作者单位:苏州大学计算机科学与技术学院,苏州,215006;苏州大学计算机科学与技术学院,苏州,215006;苏州大学计算机科学与技术学院,苏州,215006;苏州大学计算机科学与技术学院,苏州,215006
基金项目:国家自然科学基金(2017YFB1002101)资助
摘    要:为了解决嵌套命名实体关系抽取研究缺乏相关语料库这一问题, 在现有中文命名实体语料库的基础上, 将人工标注与机器学习相结合来抽取其语义关系。人工标注一个中文嵌套命名实体关系语料库, 然后分别采用支持向量机和卷积神经网络等方法, 进行中文嵌套实体关系抽取实验。实验结果表明, 在人工标注实体的中文嵌套命名实体语料上, 嵌套实体关系抽取的性能非常好, F1指数达到95%以上, 而在自动识别实体上的抽取性能尚不理想。

关 键 词:嵌套实体关系抽取  信息抽取  支持向量机  卷积神经网络
收稿时间:2018-04-15

Research on Chinese Nested Named Entity Relation Extraction
XU Haoliang,LI Yanqun,HE Yunqi,QIAN Longhua. Research on Chinese Nested Named Entity Relation Extraction[J]. Acta Scientiarum Naturalium Universitatis Pekinensis, 2019, 55(1): 8-14. DOI: 10.13209/j.0479-8023.2018.056
Authors:XU Haoliang  LI Yanqun  HE Yunqi  QIAN Longhua
Affiliation:School of Computer Science & Technology, Soochow University, Suzhou 215006
Abstract:Nested named entities relationship extraction research lacks corresponding benchmark corpora. To solve this problem, manual annotation with machine learning are combined to extract their semantic relationships from an existing Chinese named entity recognition corpus. The authors manually annotate a Chinese nested named entity relation corpus from existing Chinese named entity recognition and conduct experiments with relation extraction between nested named entities via support vector machines (SVM) and convolutional neural network (CNN) models respectively. The experimental results show that the nested entity relation extraction performs excellently on the corpus with manually labeled entities, obtaining an F1 score of over 95%, while it falls short of expectations with automatically recognized entities.
Keywords:nested entity relation extraction  information extraction  support vector machines  convolutional neural network  
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《北京大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《北京大学学报(自然科学版)》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号