BELHD: Improving Biomedical Entity Linking with Homonoym Disambiguation
Samuele Garda, Ulf Leser

TL;DR
BELHD introduces a novel approach to biomedical entity linking that effectively disambiguates homonyms by preprocessing the knowledge base and sharing candidates, significantly improving accuracy over existing methods.
Contribution
The paper presents BELHD, a new name-based biomedical entity linking method that handles homonyms through KB preprocessing and candidate sharing, outperforming state-of-the-art approaches.
Findings
BELHD achieves an average of 4.55pp recall@1 improvement.
Outperforms in 6 out of 10 corpora.
KB preprocessing benefits other BEL methods like GenBioEL.
Abstract
Biomedical entity linking (BEL) is the task of grounding entity mentions to a knowledge base (KB). A popular approach to the task are name-based methods, i.e. those identifying the most appropriate name in the KB for a given mention, either via dense retrieval or autoregressive modeling. However, as these methods directly return KB names, they cannot cope with homonyms, i.e. different KB entities sharing the exact same name. This significantly affects their performance, especially for KBs where homonyms account for a large amount of entity mentions (e.g. UMLS and NCBI Gene). We therefore present BELHD (Biomedical Entity Linking with Homonym Disambiguation), a new name-based method that copes with this challenge. Specifically, BELHD builds upon the BioSyn (Sung et al.,2020) model introducing two crucial extensions. First, it performs a preprocessing of the KB in which it expands homonyms…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies · Topic Modeling · Natural Language Processing Techniques
MethodsContrastive Learning · Balanced Selection
