Leveraging Passage Retrieval with Generative Models for Open Domain   Question Answering

Gautier Izacard; Edouard Grave

arXiv:2007.01282·cs.CL·February 4, 2021

Leveraging Passage Retrieval with Generative Models for Open Domain Question Answering

Gautier Izacard, Edouard Grave

PDF

5 Repos 2 Models

TL;DR

This paper explores how passage retrieval enhances generative models for open domain question answering, achieving state-of-the-art results and demonstrating improved performance with more retrieved passages.

Contribution

It shows that retrieving multiple passages significantly boosts generative models' accuracy in open domain QA, highlighting their ability to aggregate evidence.

Findings

01

State-of-the-art results on Natural Questions and TriviaQA.

02

Performance improves with more retrieved passages.

03

Generative models effectively combine evidence from multiple sources.

Abstract

Generative models for open domain question answering have proven to be competitive, without resorting to external knowledge. While promising, this approach requires to use models with billions of parameters, which are expensive to train and query. In this paper, we investigate how much these models can benefit from retrieving text passages, potentially containing evidence. We obtain state-of-the-art results on the Natural Questions and TriviaQA open benchmarks. Interestingly, we observe that the performance of this method significantly improves when increasing the number of retrieved passages. This is evidence that generative models are good at aggregating and combining evidence from multiple passages.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.