Retrieval-Augmented Generation for Large Language Models: A Survey

Yunfan Gao; Yun Xiong; Xinyu Gao; Kangxiang Jia; Jinliu Pan; Yuxi Bi,; Yi Dai; Jiawei Sun; Meng Wang; Haofen Wang

arXiv:2312.10997·cs.CL·March 28, 2024·633 cites

Retrieval-Augmented Generation for Large Language Models: A Survey

Yunfan Gao, Yun Xiong, Xinyu Gao, Kangxiang Jia, Jinliu Pan, Yuxi Bi,, Yi Dai, Jiawei Sun, Meng Wang, Haofen Wang

PDF

Open Access 4 Repos 1 Datasets

TL;DR

This survey comprehensively reviews Retrieval-Augmented Generation (RAG) methods for large language models, highlighting advancements, evaluation benchmarks, and future research directions to address issues like hallucination and knowledge updating.

Contribution

It provides a detailed analysis of RAG paradigms, components, state-of-the-art technologies, and introduces new evaluation frameworks and benchmarks.

Findings

01

RAG improves LLM accuracy and credibility.

02

New evaluation benchmarks for RAG systems.

03

Identification of challenges and future research directions.

Abstract

Large Language Models (LLMs) showcase impressive capabilities but encounter challenges like hallucination, outdated knowledge, and non-transparent, untraceable reasoning processes. Retrieval-Augmented Generation (RAG) has emerged as a promising solution by incorporating knowledge from external databases. This enhances the accuracy and credibility of the generation, particularly for knowledge-intensive tasks, and allows for continuous knowledge updates and integration of domain-specific information. RAG synergistically merges LLMs' intrinsic knowledge with the vast, dynamic repositories of external databases. This comprehensive review paper offers a detailed examination of the progression of RAG paradigms, encompassing the Naive RAG, the Advanced RAG, and the Modular RAG. It meticulously scrutinizes the tripartite foundation of RAG frameworks, which includes the retrieval, the generation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Datasets

BAAI/SurveyScope
dataset· 6 dl
6 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques

MethodsMulti-Head Attention · Attention Is All You Need · WordPiece · Linear Layer · Byte Pair Encoding · Dense Connections · Adam · Linear Warmup With Linear Decay · Attention Dropout · Residual Connection