Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks
Patrick Lewis, Ethan Perez, Aleksandra Piktus, Fabio Petroni, Vladimir, Karpukhin, Naman Goyal, Heinrich K\"uttler, Mike Lewis, Wen-tau Yih, Tim, Rockt\"aschel, Sebastian Riedel, Douwe Kiela

TL;DR
This paper introduces retrieval-augmented generation (RAG) models that combine parametric and non-parametric memory to improve performance on knowledge-intensive NLP tasks, achieving state-of-the-art results and more factual, diverse language generation.
Contribution
The paper presents a general-purpose fine-tuning approach for RAG models that integrate pre-trained seq2seq models with a dense Wikipedia index, outperforming existing methods on several tasks.
Findings
RAG models set new state-of-the-art on three open domain QA tasks.
RAG models produce more specific and diverse language than parametric-only models.
Different retrieval strategies impact the quality and diversity of generated text.
Abstract
Large pre-trained language models have been shown to store factual knowledge in their parameters, and achieve state-of-the-art results when fine-tuned on downstream NLP tasks. However, their ability to access and precisely manipulate knowledge is still limited, and hence on knowledge-intensive tasks, their performance lags behind task-specific architectures. Additionally, providing provenance for their decisions and updating their world knowledge remain open research problems. Pre-trained models with a differentiable access mechanism to explicit non-parametric memory can overcome this issue, but have so far been only investigated for extractive downstream tasks. We explore a general-purpose fine-tuning recipe for retrieval-augmented generation (RAG) -- models which combine pre-trained parametric and non-parametric memory for language generation. We introduce RAG models where the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗facebook/rag-sequence-basemodel· 178 dl· ♡ 10178 dl♡ 10
- 🤗facebook/rag-sequence-nqmodel· 10k dl· ♡ 4310k dl♡ 43
- 🤗facebook/rag-token-basemodel· 409 dl· ♡ 18409 dl♡ 18
- 🤗facebook/rag-token-nqmodel· 5.1k dl· ♡ 1785.1k dl♡ 178
- 🤗coniferlabs/flan-ul2-dolly-loramodel· ♡ 12♡ 12
- 🤗HuggingWorm/RagRetrievermodel
- 🤗hauson-fan/RagRetrievermodel· 2 dl2 dl
- 🤗seritwin/Virginia-food-deliverymodel
Videos
Taxonomy
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Byte Pair Encoding · Adam · Residual Connection · Dense Connections · Linear Warmup With Linear Decay · Weight Decay
