首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Disambiguating Authors by Pairwise Classification
Authors:Quan Lin  &#x; Ì  Bo Wang   Π Yuan Du   &#x;  Xuezhi Wang   ê  Yuhua Li  &#x;  Songcan Chen  &#x;
Institution:a Department of Computer Science, Huazhong University of Science and Technology, Wuhan 430074, China;b Department of Computer Science, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China;c Department of Computer Science, Tsinghua University, Beijing 100084, China
Abstract:Name ambiguity is a critical problem in many applications, in particular in online bibliography systems, such as DBLP, ACM, and CiteSeerx. Despite the many studies, this problem is still not resolved and is becoming even more serious, especially with the increasing popularity of Web 2.0. This paper addresses the problem in the academic researcher social network ArnetMiner using a supervised method for exploiting all side information including co-author, organization, paper citation, title similarity, author's homepage, web constraint, and user feedback. The method automatically determines the person number k. Tests on the researcher social network with up to 100 different names show that the method significantly outperforms the baseline method using an unsupervised attribute-augmented graph clustering algorithm.
Keywords:disambiguating  pairwise classification  arnetminer
本文献已被 CNKI ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号