Learning Dense Representations of Phrases at Scale

Jinhyuk Lee; Mujeen Sung; Jaewoo Kang; Danqi Chen

arXiv:2012.12624·cs.CL·June 3, 2021·5 cites

Learning Dense Representations of Phrases at Scale

Jinhyuk Lee, Mujeen Sung, Jaewoo Kang, Danqi Chen

PDF

Open Access 4 Repos 1 Models

TL;DR

This paper introduces DensePhrases, a method for learning dense phrase representations that significantly improves open-domain question answering accuracy and efficiency, enabling fast retrieval and downstream task application.

Contribution

The paper presents a novel approach to learn dense phrase representations from reading comprehension supervision, outperforming previous sparse models and enabling scalable, fast retrieval.

Findings

01

DensePhrases improves QA accuracy by 15-25% over previous models.

02

The model processes over 10 questions per second on CPUs.

03

Dense representations are effective for downstream slot filling tasks.

Abstract

Open-domain question answering can be reformulated as a phrase retrieval problem, without the need for processing documents on-demand during inference (Seo et al., 2019). However, current phrase retrieval models heavily depend on sparse representations and still underperform retriever-reader approaches. In this work, we show for the first time that we can learn dense representations of phrases alone that achieve much stronger performance in open-domain QA. We present an effective method to learn phrase representations from the supervision of reading comprehension tasks, coupled with novel negative sampling methods. We also propose a query-side fine-tuning strategy, which can support transfer learning and reduce the discrepancy between training and inference. On five popular open-domain QA datasets, our model DensePhrases improves over previous phrase retrieval models by 15%-25% absolute…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

🤗
softdev629/scms-demo
model

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Information Retrieval and Search Behavior