Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

Patrick Lewis; Ethan Perez; Aleksandra Piktus; Fabio Petroni; Vladimir; Karpukhin; Naman Goyal; Heinrich K\"uttler; Mike Lewis; Wen-tau Yih; Tim; Rockt\"aschel; Sebastian Riedel; Douwe Kiela

arXiv:2005.11401·cs.CL·April 13, 2021

Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

Patrick Lewis, Ethan Perez, Aleksandra Piktus, Fabio Petroni, Vladimir, Karpukhin, Naman Goyal, Heinrich K\"uttler, Mike Lewis, Wen-tau Yih, Tim, Rockt\"aschel, Sebastian Riedel, Douwe Kiela

PDF

5 Repos 8 Models 1 Datasets 3 Videos

TL;DR

This paper introduces retrieval-augmented generation (RAG) models that combine parametric and non-parametric memory to improve performance on knowledge-intensive NLP tasks, achieving state-of-the-art results and more factual, diverse language generation.

Contribution

The paper presents a general-purpose fine-tuning approach for RAG models that integrate pre-trained seq2seq models with a dense Wikipedia index, outperforming existing methods on several tasks.

Findings

01

RAG models set new state-of-the-art on three open domain QA tasks.

02

RAG models produce more specific and diverse language than parametric-only models.

03

Different retrieval strategies impact the quality and diversity of generated text.

Abstract

Large pre-trained language models have been shown to store factual knowledge in their parameters, and achieve state-of-the-art results when fine-tuned on downstream NLP tasks. However, their ability to access and precisely manipulate knowledge is still limited, and hence on knowledge-intensive tasks, their performance lags behind task-specific architectures. Additionally, providing provenance for their decisions and updating their world knowledge remain open research problems. Pre-trained models with a differentiable access mechanism to explicit non-parametric memory can overcome this issue, but have so far been only investigated for extractive downstream tasks. We explore a general-purpose fine-tuning recipe for retrieval-augmented generation (RAG) -- models which combine pre-trained parametric and non-parametric memory for language generation. We introduce RAG models where the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Datasets

alpha-one-index/awesome-ai-index
dataset· 167 dl
167 dl

Videos

AI in 2024 - efficiency over model size (Nick Jakobi)· youtube

#100 Dr. PATRICK LEWIS - Retrieval Augmented Generation· youtube

Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks· slideslive

Taxonomy

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Byte Pair Encoding · Adam · Residual Connection · Dense Connections · Linear Warmup With Linear Decay · Weight Decay