On the use of topological features and hierarchical characterization for disambiguating names in collaborative networks
Diego R. Amancio, Osvaldo N. Oliveira Jr., Luciano da F. Costa

TL;DR
This paper explores how hierarchical topological features and pattern recognition can improve the disambiguation of author names in collaborative networks, especially by considering longer-range connections and multiple data sources.
Contribution
It demonstrates that using third-level hierarchical connections and network topology enhances the ability to distinguish homonymous authors in collaborative networks.
Findings
Longer-distance connections improve disambiguation accuracy.
Pattern recognition strategies effectively utilize network topology.
Combining topology features with long-range connections further enhances results.
Abstract
Many features of complex systems can now be unveiled by applying statistical physics methods to treat them as social networks. The power of the analysis may be limited, however, by the presence of ambiguity in names, e.g., caused by homonymy in collaborative networks. In this paper we show that the ability to distinguish between homonymous authors is enhanced when longer-distance connections are considered, rather than looking at only the immediate neighbors of a node in the collaborative network. Optimized results were obtained upon using the 3rd hierarchy in connections. Furthermore, reasonable distinction among authors could also be achieved upon using pattern recognition strategies for the data generated from the topology of the collaborative network. These results were obtained with a network from papers in the arXiv repository, into which homonymy was deliberately introduced to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
