Self-RAG: Learning to Retrieve, Generate, and Critique through   Self-Reflection

Akari Asai; Zeqiu Wu; Yizhong Wang; Avirup Sil; Hannaneh Hajishirzi

arXiv:2310.11511·cs.CL·October 19, 2023·86 cites

Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection

Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, Hannaneh Hajishirzi

PDF

Open Access 5 Repos 10 Models 2 Datasets

TL;DR

Self-RAG is a novel framework that improves large language models by enabling adaptive retrieval and self-reflection, leading to more accurate and controllable responses across diverse tasks.

Contribution

It introduces Self-RAG, a unified model that adaptively retrieves information and uses self-reflection tokens to enhance factuality and task adaptability.

Findings

01

Self-RAG outperforms state-of-the-art LLMs and retrieval-augmented models.

02

It improves factuality and citation accuracy in long-form generation.

03

Self-RAG surpasses ChatGPT and Llama2-chat on various benchmarks.

Abstract

Despite their remarkable capabilities, large language models (LLMs) often produce responses containing factual inaccuracies due to their sole reliance on the parametric knowledge they encapsulate. Retrieval-Augmented Generation (RAG), an ad hoc approach that augments LMs with retrieval of relevant knowledge, decreases such issues. However, indiscriminately retrieving and incorporating a fixed number of retrieved passages, regardless of whether retrieval is necessary, or passages are relevant, diminishes LM versatility or can lead to unhelpful response generation. We introduce a new framework called Self-Reflective Retrieval-Augmented Generation (Self-RAG) that enhances an LM's quality and factuality through retrieval and self-reflection. Our framework trains a single arbitrary LM that adaptively retrieves passages on-demand, and generates and reflects on retrieved passages and its own…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

MethodsSparse Evolutionary Training · High-Order Consensuses