Resolving Author Name Homonymy to Improve Resolution of Structures in   Co-author Networks

Theresa Velden; Asif-ul Haque; Carl Lagoze

arXiv:1106.2473·cs.DL·June 14, 2011·1 cites

Resolving Author Name Homonymy to Improve Resolution of Structures in Co-author Networks

Theresa Velden, Asif-ul Haque, Carl Lagoze

PDF

Open Access

TL;DR

This paper presents a scalable algorithm to resolve author name homonymy, improving the accuracy of co-author network structures by distinguishing node roles and assessing network distortion without extensive ground truth data.

Contribution

The authors introduce a novel, effective method for disambiguating author names in large co-author networks, enhancing the resolution of mesoscopic structures.

Findings

01

Algorithm effectively reduces network distortion caused by homonymy

02

Node role distinction improves network analysis accuracy

03

Method does not require extensive ground truth sampling

Abstract

We investigate how author name homonymy distorts clustered large-scale co-author networks, and present a simple, effective, scalable and generalizable algorithm to ameliorate such distortions. We evaluate the performance of the algorithm to improve the resolution of mesoscopic network structures. To this end, we establish the ground truth for a sample of author names that is statistically representative of different types of nodes in the co-author network, distinguished by their role for the connectivity of the network. We finally observe that this distinction of node roles based on the mesoscopic structure of the network, in combination with a quantification of author name commonality, suggests a new approach to assess network distortion by homonymy and to analyze the reduction of distortion in the network after disambiguation, without requiring ground truth sampling.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Quality and Management · Advanced Graph Neural Networks · Biomedical Text Mining and Ontologies