MOLEMAN: Mention-Only Linking of Entities with a Mention Annotation Network
Nicholas FitzGerald, Jan A. Botha, Daniel Gillick, Daniel M. Bikel,, Tom Kwiatkowski, Andrew McCallum

TL;DR
MOLEMAN introduces a mention-only entity linking method that uses a contextualized mention encoder and nearest neighbor retrieval, outperforming existing systems on multilingual benchmarks.
Contribution
It proposes a novel mention-based entity linking approach with a contextualized mention encoder and large-scale nearest neighbor retrieval, improving interpretability and performance.
Findings
Outperforms existing systems on multilingual benchmarks.
Uses a large index of 700 million mentions for retrieval.
Simpler training process with more interpretable predictions.
Abstract
We present an instance-based nearest neighbor approach to entity linking. In contrast to most prior entity retrieval systems which represent each entity with a single vector, we build a contextualized mention-encoder that learns to place similar mentions of the same entity closer in vector space than mentions of different entities. This approach allows all mentions of an entity to serve as "class prototypes" as inference involves retrieving from the full set of labeled entity mentions in the training set and applying the nearest mention neighbor's entity label. Our model is trained on a large multilingual corpus of mention pairs derived from Wikipedia hyperlinks, and performs nearest neighbor inference on an index of 700 million mentions. It is simpler to train, gives more interpretable predictions, and outperforms all other systems on two multilingual entity linking benchmarks.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
