首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于COAE2016数据集的中文实体关系抽取算法研究
引用本文:孙建东,顾秀森,李彦,徐蔚然.基于COAE2016数据集的中文实体关系抽取算法研究[J].山东大学学报(理学版),2017,52(9):7-12.
作者姓名:孙建东  顾秀森  李彦  徐蔚然
作者单位:北京邮电大学模式识别与智能系统实验室, 北京 100876
基金项目:111计划资助项目(B08004);国家自然科学基金资助项目(61300080,61273217,61671078);国家教育部博士点基金资助项目(20130005110004)
摘    要:实体关系抽取是知识图谱技术的重要环节之一。英文实体关系抽取的研究已经比较成熟,相比之下,中文实体关系抽取的发展却并不理想。由于相关语料的匮乏,中文实体关系抽取的发展受到了一定的限制。针对这一问题,COAE2016在任务三中提出了中文实体关系抽取任务。通过分别使用了基于模板、基于SVM与基于CNN的实体关系抽取算法解决了这一问题,并根据其在COAE2016任务三的评测数据集上的效果,对比分析了三种实体关系抽取算法的优缺点。实验证明,基于SVM的算法和基于CNN的算法均在评测数据集上表现出了良好的效果。

关 键 词:关系抽取  模板匹配  SVM  CNN  
收稿时间:2016-11-25

Chinese entity relation extraction algorithms based on COAE2016 datasets
SUN Jian-dong,GU Xiu-sen,LI Yan,XU Wei-ran.Chinese entity relation extraction algorithms based on COAE2016 datasets[J].Journal of Shandong University,2017,52(9):7-12.
Authors:SUN Jian-dong  GU Xiu-sen  LI Yan  XU Wei-ran
Institution:Beijing University of Posts and Telecommunications, Lab of Pattern Recognition and Intelligent System, Beijing 100876, China
Abstract:Entity relation extraction is one of the important procedures of knowledge graph technology. Research on entity relation extraction in English is comparatively developed. By contrast, the development of Chinese entity relation extraction is not ideal, and it is mainly because the lack of corpus. In order to solve this problem, COAE2016 proposes a Chinese entity relation extraction task in task 3. In this paper, we use three algorithms to solve the problem: a pattern based algorithm, a SVM based algorithm and a CNN based algorithm respectively. Then, we analyze the advantages and the disadvantages of the three algorithms according to the effects of the dataset in COAE2016 Experiments show that the SVM based algorithm and the CNN based algorithm are useful to extract entity relation.
Keywords:feature extraction  SVM  CNN  pattern match  
本文献已被 CNKI 等数据库收录!
点击此处可从《山东大学学报(理学版)》浏览原始摘要信息
点击此处可从《山东大学学报(理学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号