首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于函数依赖的属性相似度调整算法
引用本文:谭明超a,刁兴春a,曹建军a,冯径b. 一种基于函数依赖的属性相似度调整算法[J]. 上海交通大学学报, 2015, 49(8): 1075-1083
作者姓名:谭明超a  刁兴春a  曹建军a  冯径b
作者单位:(解放军理工大学 a.指挥信息系统学院,南京 210007; b.气象海洋学院,南京 211101)
基金项目:国家自然科学基金项目 (61070714),解放军理工大学预研基金项目(20110604)资助
摘    要:属性相似度的准确性是影响实体分辨准确程度的重要因素之一.为提高属性相似度的准确性,分析了属性相似度与函数依赖的关系,给出了属性相似度调整原则,提出了依据函数依赖进行相似度划分、相似度传递调整和计算相似度调整代价的方法,提出了通过属性相似度调整提高属性相似度准确性的属性相似度传递调整算法.实验结果表明,该算法能够更好地区分匹配记录对和不匹配记录对,获得更高的查全率、查准率和F1值.

关 键 词:实体分辨   属性相似度   函数依赖  
收稿时间:2014-10-27

An Attribute Similarity Adjusting Algorithm Based on Functional Dependency
TAN Ming chaoa,DIAO Xing chuna,CAO Jian juna,FENG Jingb. An Attribute Similarity Adjusting Algorithm Based on Functional Dependency[J]. Journal of Shanghai Jiaotong University, 2015, 49(8): 1075-1083
Authors:TAN Ming chaoa  DIAO Xing chuna  CAO Jian juna  FENG Jingb
Affiliation:(a. College of Command Information Systems, Nanjing  210007, China; b. College of Meteorology and Oceanography, PLA University of Science and Technology,  Nanjing  211101, China)
Abstract:Abstract: The accuracy of attribute similarity is one of the important factors affecting the precision of entity resolution (ER). To improve the accuracy of attribute similarity, the relation between attribute similarity and functional dependency (FD) was analyzed and the principles for attribute similarity adjusting were suggested. The FD based methods for similarity partition, similarity transitively adjusting and cost computing of similarity adjusting were proposed. An algorithm for attribute similarity adjusting with FD (SAWFD) was put forward to improve the accuracy of attribute similarity. The experiment results show that the algorithm can better distinguish matching and unmatching records, and get higher scores of recall, precision and F1 measure.
Keywords:entity resolution   attribute similarity   functional dependencies  
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《上海交通大学学报》浏览原始摘要信息
点击此处可从《上海交通大学学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号