Learning Dense Representations for Entity Retrieval

Daniel Gillick; Sayali Kulkarni; Larry Lansing; Alessandro Presta,; Jason Baldridge; Eugene Ie; Diego Garcia-Olano

arXiv:1909.10506·cs.CL·September 24, 2019

Learning Dense Representations for Entity Retrieval

Daniel Gillick, Sayali Kulkarni, Larry Lansing, Alessandro Presta,, Jason Baldridge, Eugene Ie, Diego Garcia-Olano

PDF

TL;DR

This paper introduces a fully learned entity retrieval model using a dual encoder that encodes mentions and entities in the same dense space, enabling fast retrieval without alias tables and outperforming traditional baselines.

Contribution

It presents the first fully learned entity retrieval system based on dual encoders trained solely on Wikipedia anchor links, improving over previous methods.

Findings

01

Outperforms alias table and BM25 baselines.

02

Achieves competitive results on TACKBP-2010 dataset.

03

Retrieves candidates extremely fast and generalizes well.

Abstract

We show that it is feasible to perform entity linking by training a dual encoder (two-tower) model that encodes mentions and entities in the same dense vector space, where candidate entities are retrieved by approximate nearest neighbor search. Unlike prior work, this setup does not rely on an alias table followed by a re-ranker, and is thus the first fully learned entity retrieval model. We show that our dual encoder, trained using only anchor-text links in Wikipedia, outperforms discrete alias table and BM25 baselines, and is competitive with the best comparable results on the standard TACKBP-2010 dataset. In addition, it can retrieve candidates extremely fast, and generalizes well to a new dataset derived from Wikinews. On the modeling side, we demonstrate the dramatic value of an unsupervised negative mining algorithm for this task.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.