A Comprehensive Survey of Retrieval-Augmented Generation (RAG):   Evolution, Current Landscape and Future Directions

Shailja Gupta; Rajesh Ranjan; Surya Narayan Singh

arXiv:2410.12837·cs.CL·October 18, 2024·20 cites

A Comprehensive Survey of Retrieval-Augmented Generation (RAG): Evolution, Current Landscape and Future Directions

Shailja Gupta, Rajesh Ranjan, Surya Narayan Singh

PDF

Open Access

TL;DR

This survey comprehensively reviews Retrieval-Augmented Generation (RAG), detailing its evolution, current state, technological advancements, challenges, and future research directions in natural language processing.

Contribution

It provides a detailed overview of RAG's architecture, innovations, applications, and challenges, serving as a foundational resource for future research and development.

Findings

01

RAG enhances language model accuracy through retrieval mechanisms.

02

Recent advances improve retrieval efficiency and application scope.

03

Challenges include scalability, bias, and ethical concerns.

Abstract

This paper presents a comprehensive study of Retrieval-Augmented Generation (RAG), tracing its evolution from foundational concepts to the current state of the art. RAG combines retrieval mechanisms with generative language models to enhance the accuracy of outputs, addressing key limitations of LLMs. The study explores the basic architecture of RAG, focusing on how retrieval and generation are integrated to handle knowledge-intensive tasks. A detailed review of the significant technological advancements in RAG is provided, including key innovations in retrieval-augmented language models and applications across various domains such as question-answering, summarization, and knowledge-based tasks. Recent research breakthroughs are discussed, highlighting novel methods for improving retrieval efficiency. Furthermore, the paper examines ongoing challenges such as scalability, bias, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · Algorithms and Data Compression

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Linear Layer · Byte Pair Encoding · Softmax · Multi-Head Attention · WordPiece · Dropout · Layer Normalization · Adam · Attention Dropout