On the use of topological features and hierarchical characterization for   disambiguating names in collaborative networks

Diego R. Amancio; Osvaldo N. Oliveira Jr.; Luciano da F. Costa

arXiv:1302.4504·physics.soc-ph·February 20, 2013

On the use of topological features and hierarchical characterization for disambiguating names in collaborative networks

Diego R. Amancio, Osvaldo N. Oliveira Jr., Luciano da F. Costa

PDF

TL;DR

This paper explores how hierarchical topological features and pattern recognition can improve the disambiguation of author names in collaborative networks, especially by considering longer-range connections and multiple data sources.

Contribution

It demonstrates that using third-level hierarchical connections and network topology enhances the ability to distinguish homonymous authors in collaborative networks.

Findings

01

Longer-distance connections improve disambiguation accuracy.

02

Pattern recognition strategies effectively utilize network topology.

03

Combining topology features with long-range connections further enhances results.

Abstract

Many features of complex systems can now be unveiled by applying statistical physics methods to treat them as social networks. The power of the analysis may be limited, however, by the presence of ambiguity in names, e.g., caused by homonymy in collaborative networks. In this paper we show that the ability to distinguish between homonymous authors is enhanced when longer-distance connections are considered, rather than looking at only the immediate neighbors of a node in the collaborative network. Optimized results were obtained upon using the 3rd hierarchy in connections. Furthermore, reasonable distinction among authors could also be achieved upon using pattern recognition strategies for the data generated from the topology of the collaborative network. These results were obtained with a network from papers in the arXiv repository, into which homonymy was deliberately introduced to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.