Discovering Entities with Just a Little Help from You
Jaspreet Singh, Johannes Hoffart, Avishek Anand

TL;DR
This paper introduces a human-in-the-loop approach for entity linking that improves disambiguation of long-tail and emerging entities by involving user feedback to generate better candidate suggestions, especially when automated methods fall short.
Contribution
It presents novel gradient interleaving retrieval methods that incorporate user feedback to enhance candidate generation for entity linking, addressing the challenge of long-tail and emerging entities.
Findings
Outperforms baseline methods in intrinsic and extrinsic evaluations.
Engages users effectively in the disambiguation process.
Improves linking accuracy for long-tail and emerging entities.
Abstract
Linking entities like people, organizations, books, music groups and their songs in text to knowledge bases (KBs) is a fundamental task for many downstream search and mining applications. Achieving high disambiguation accuracy crucially depends on a rich and holistic representation of the entities in the KB. For popular entities, such a representation can be easily mined from Wikipedia, and many current entity disambiguation and linking methods make use of this fact. However, Wikipedia does not contain long-tail entities that only few people are interested in, and also at times lags behind until newly emerging entities are added. For such entities, mining a suitable representation in a fully automated fashion is very difficult, resulting in poor linking accuracy. What can automatically be mined, though, is a high-quality representation given the context of a new entity occurring in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
