首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于维基百科信息框的本体信息提取
引用本文:陈刚,徐星羽.基于维基百科信息框的本体信息提取[J].吉林大学学报(理学版),2020,58(2):355-363.
作者姓名:陈刚  徐星羽
作者单位:武汉大学 国家网络安全学院, 武汉 430079
基金项目:湖北省数字出版专项基金
摘    要:针对传统方法在维基百科信息框中提取本体信息精准率较低的问题, 研究维基百科信息框中的属性结构化信息. 首先定义一组候选特征判定信息框属性之间的关系, 建立与类别、 列表、 文章及维基百科信息框模板之间的关联; 然后借鉴本体匹配方法提取维基百科信息框结构化信息, 计算属性对的相似度, 设置边界限制条件, 在达到一定精确度下构建本体结构描述属性之间的关系, 并构建类层次结构. 结果表明, 所给方法解决了提取本体信息精准率较低的问题, 能高效、 正确地在给定主题文章中将可能的属性结构提取出来, 并发现合理的类关系.

关 键 词:维基百科    信息框    本体    类层次  
收稿时间:2018-11-02

Ontology Information Extraction Based on Wikipedia Information Box
CHEN Gang,XU Xingyu.Ontology Information Extraction Based on Wikipedia Information Box[J].Journal of Jilin University: Sci Ed,2020,58(2):355-363.
Authors:CHEN Gang  XU Xingyu
Institution:School of Cyber Science and Engineering, Wuhan University, Wuhan 430079, China
Abstract:Aiming at the problem of low accuracy of extracting ontology information from Wikipedia information box in traditional methods, we studied the attribute structured information in Wikipedia information box. Firstly, a set of candidate features was defined to determine the relationship between information box attributes, and the association with categories, lists, articles and Wikipedia information box templates was established. Secondly, using the method of ontology matching to extract the structured information of Wikipedia information box, calculate the similarity of attribute pairs, set the boundary constraints, and construct ontology structure to explain the relationship between attributes and construct a class hierarchy with a certain accuracy. The results show that the proposed method solves the problem of low accuracy of extracting ontology inform ation, and can extract the possible attribute structure in a given topic article effectively and correctly, and find the reasonable class relationship.
Keywords:   Wikipedia  information box  ontology  class hierarchy  
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《吉林大学学报(理学版)》浏览原始摘要信息
点击此处可从《吉林大学学报(理学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号