Author Name Disambiguation via Heterogeneous Network Embedding from Structural and Semantic Perspectives
Wenjin Xie, Siyuan Liu, Xiaomeng Wang, Tao Jia

TL;DR
This paper introduces a novel heterogeneous network embedding approach combining structural and semantic information, enhanced by self-attention, to improve author name disambiguation accuracy in academic digital libraries.
Contribution
The paper proposes a new method that integrates structural and semantic representations with self-attention for more accurate author disambiguation, addressing feature selection challenges.
Findings
Outperforms baseline methods in name disambiguation accuracy
Meta-path level attention improves feature weighting and model performance
Semantic and structural integration enhances attribute capturing
Abstract
Name ambiguity is common in academic digital libraries, such as multiple authors having the same name. This creates challenges for academic data management and analysis, thus name disambiguation becomes necessary. The procedure of name disambiguation is to divide publications with the same name into different groups, each group belonging to a unique author. A large amount of attribute information in publications makes traditional methods fall into the quagmire of feature selection. These methods always select attributes artificially and equally, which usually causes a negative impact on accuracy. The proposed method is mainly based on representation learning for heterogeneous networks and clustering and exploits the self-attention technology to solve the problem. The presentation of publications is a synthesis of structural and semantic representations. The structural representation is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Quality and Management · Topic Modeling · Biomedical Text Mining and Ontologies
MethodsFeature Selection
