Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection
Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, Hannaneh Hajishirzi

TL;DR
Self-RAG is a novel framework that improves large language models by enabling adaptive retrieval and self-reflection, leading to more accurate and controllable responses across diverse tasks.
Contribution
It introduces Self-RAG, a unified model that adaptively retrieves information and uses self-reflection tokens to enhance factuality and task adaptability.
Findings
Self-RAG outperforms state-of-the-art LLMs and retrieval-augmented models.
It improves factuality and citation accuracy in long-form generation.
Self-RAG surpasses ChatGPT and Llama2-chat on various benchmarks.
Abstract
Despite their remarkable capabilities, large language models (LLMs) often produce responses containing factual inaccuracies due to their sole reliance on the parametric knowledge they encapsulate. Retrieval-Augmented Generation (RAG), an ad hoc approach that augments LMs with retrieval of relevant knowledge, decreases such issues. However, indiscriminately retrieving and incorporating a fixed number of retrieved passages, regardless of whether retrieval is necessary, or passages are relevant, diminishes LM versatility or can lead to unhelpful response generation. We introduce a new framework called Self-Reflective Retrieval-Augmented Generation (Self-RAG) that enhances an LM's quality and factuality through retrieval and self-reflection. Our framework trains a single arbitrary LM that adaptively retrieves passages on-demand, and generates and reflects on retrieved passages and its own…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗selfrag/selfrag_llama2_7bmodel· 2.5k dl· ♡ 822.5k dl♡ 82
- 🤗selfrag/selfrag_llama2_13bmodel· 122 dl· ♡ 61122 dl♡ 61
- 🤗devilteo911/selfrag_llama2_7b-q8_0model· 5 dl· ♡ 25 dl♡ 2
- 🤗SciPhi/SciPhi-Self-RAG-Mistral-7B-32kmodel· 988 dl· ♡ 90988 dl♡ 90
- 🤗TheBloke/SciPhi-Self-RAG-Mistral-7B-32k-GGUFmodel· 434 dl· ♡ 22434 dl♡ 22
- 🤗TheBloke/SciPhi-Self-RAG-Mistral-7B-32k-AWQmodel· 6 dl· ♡ 56 dl♡ 5
- 🤗TheBloke/SciPhi-Self-RAG-Mistral-7B-32k-GPTQmodel· 10 dl· ♡ 210 dl♡ 2
- 🤗LoneStriker/SciPhi-Self-RAG-Mistral-7B-32k-3.0bpw-h6-exl2model· 1 dl1 dl
- 🤗LoneStriker/SciPhi-Self-RAG-Mistral-7B-32k-4.0bpw-h6-exl2model· 2 dl2 dl
- 🤗LoneStriker/SciPhi-Self-RAG-Mistral-7B-32k-5.0bpw-h6-exl2model· 1 dl1 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsSparse Evolutionary Training · High-Order Consensuses
