TL;DR
This paper demonstrates that dense phrase retrieval not only excels at fine-grained tasks but also effectively supports coarse-level retrieval like passages and documents, improving accuracy and efficiency.
Contribution
It shows that phrase retrieval can serve as a strong foundation for passage and document retrieval without retraining, and introduces methods to improve efficiency and performance.
Findings
Phrase retrieval outperforms passage retrievers in passage accuracy (+3-5%).
Phrase retrieval enhances end-to-end QA with fewer passages.
Index size reduced by 4-10x using filtering and quantization.
Abstract
Dense retrieval methods have shown great promise over sparse retrieval methods in a range of NLP problems. Among them, dense phrase retrieval-the most fine-grained retrieval unit-is appealing because phrases can be directly used as the output for question answering and slot filling tasks. In this work, we follow the intuition that retrieving phrases naturally entails retrieving larger text blocks and study whether phrase retrieval can serve as the basis for coarse-level retrieval including passages and documents. We first observe that a dense phrase-retrieval system, without any retraining, already achieves better passage retrieval accuracy (+3-5% in top-5 accuracy) compared to passage retrievers, which also helps achieve superior end-to-end QA performance with fewer passages. Then, we provide an interpretation for why phrase-level supervision helps learn better fine-grained entailment…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
