Network-based stochastic competitive learning approach to disambiguation in collaborative networks
Thiago C. Silva, Diego R. Amancio

TL;DR
This paper introduces a network-based stochastic competitive learning method using particle competition for name disambiguation in collaborative networks, improving clustering accuracy by capturing topological features.
Contribution
It proposes a novel unsupervised particle competition model that combines random and preferential walks for effective name disambiguation in complex networks.
Findings
Model outperforms traditional clustering methods in accuracy.
Effective in networks from arXiv and other databases.
Captures topological features to improve disambiguation.
Abstract
Many patterns have been uncovered in complex systems through the application of concepts and methodologies of complex networks. Unfortunately, the validity and accuracy of the unveiled patterns are strongly dependent on the amount of unavoidable noise pervading the data, such as the presence of homonymous individuals in social networks. In the current paper, we investigate the problem of name disambiguation in collaborative networks, a task that plays a fundamental role on a myriad of scientific contexts. In special, we use an unsupervised technique which relies on a particle competition mechanism in a networked environment to detect the clusters. It has been shown that, in this kind of environment, the learning process can be improved because the network representation of data can capture topological features of the input data set. Specifically, in the proposed disambiguating model, a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
