Entity Synonym Discovery via Multipiece Bilateral Context Matching
Chenwei Zhang, Yaliang Li, Nan Du, Wei Fan, Philip S. Yu

TL;DR
This paper introduces SYNONYMNET, a neural network model that leverages multiple context pieces to accurately discover synonymous entities across diverse datasets, improving robustness and effectiveness over existing methods.
Contribution
It generalizes the distributional hypothesis to a multi-context setting and proposes a bilateral matching framework for synonym detection using free-text corpora.
Findings
Achieves up to 4.16% AUC improvement over baselines.
Effective on both generic and domain-specific datasets.
Detects unseen synonym sets during training.
Abstract
Being able to automatically discover synonymous entities in an open-world setting benefits various tasks such as entity disambiguation or knowledge graph canonicalization. Existing works either only utilize entity features, or rely on structured annotations from a single piece of context where the entity is mentioned. To leverage diverse contexts where entities are mentioned, in this paper, we generalize the distributional hypothesis to a multi-context setting and propose a synonym discovery framework that detects entity synonyms from free-text corpora with considerations on effectiveness and robustness. As one of the key components in synonym discovery, we introduce a neural network model SYNONYMNET to determine whether or not two given entities are synonym with each other. Instead of using entities features, SYNONYMNET makes use of multiple pieces of contexts in which the entity is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Biomedical Text Mining and Ontologies
