Phrase Retrieval Learns Passage Retrieval, Too

Jinhyuk Lee; Alexander Wettig; Danqi Chen

arXiv:2109.08133·cs.CL·September 17, 2021

Phrase Retrieval Learns Passage Retrieval, Too

Jinhyuk Lee, Alexander Wettig, Danqi Chen

PDF

1 Repo

TL;DR

This paper demonstrates that dense phrase retrieval not only excels at fine-grained tasks but also effectively supports coarse-level retrieval like passages and documents, improving accuracy and efficiency.

Contribution

It shows that phrase retrieval can serve as a strong foundation for passage and document retrieval without retraining, and introduces methods to improve efficiency and performance.

Findings

01

Phrase retrieval outperforms passage retrievers in passage accuracy (+3-5%).

02

Phrase retrieval enhances end-to-end QA with fewer passages.

03

Index size reduced by 4-10x using filtering and quantization.

Abstract

Dense retrieval methods have shown great promise over sparse retrieval methods in a range of NLP problems. Among them, dense phrase retrieval-the most fine-grained retrieval unit-is appealing because phrases can be directly used as the output for question answering and slot filling tasks. In this work, we follow the intuition that retrieving phrases naturally entails retrieving larger text blocks and study whether phrase retrieval can serve as the basis for coarse-level retrieval including passages and documents. We first observe that a dense phrase-retrieval system, without any retraining, already achieves better passage retrieval accuracy (+3-5% in top-5 accuracy) compared to passage retrievers, which also helps achieve superior end-to-end QA performance with fewer passages. Then, we provide an interpretation for why phrase-level supervision helps learn better fine-grained entailment…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

princeton-nlp/DensePhrases
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.