A Systematic Literature Review of Retrieval-Augmented Generation: Techniques, Metrics, and Challenges

Andrew Brown; Muhammad Roman; Barry Devereux

arXiv:2508.06401·cs.DL·September 10, 2025·3 cites

A Systematic Literature Review of Retrieval-Augmented Generation: Techniques, Metrics, and Challenges

Andrew Brown, Muhammad Roman, Barry Devereux

PDF

Open Access

TL;DR

This systematic review analyzes highly cited research on retrieval-augmented generation (RAG), summarizing techniques, metrics, and challenges, and providing insights into its effectiveness, limitations, and future research directions.

Contribution

It offers a comprehensive synthesis of recent RAG studies, highlighting methodological gaps and proposing future research priorities.

Findings

01

RAG effectively grounds language models in up-to-date information

02

Identifies key datasets, architectures, and evaluation practices in RAG research

03

Highlights limitations and challenges in current RAG methodologies

Abstract

This systematic review of the research literature on retrieval-augmented generation (RAG) provides a focused analysis of the most highly cited studies published between 2020 and May 2025. A total of 128 articles met our inclusion criteria. The records were retrieved from ACM Digital Library, IEEE Xplore, Scopus, ScienceDirect, and the Digital Bibliography and Library Project (DBLP). RAG couples a neural retriever with a generative language model, grounding output in up-to-date, non-parametric memory while retaining the semantic generalisation stored in model weights. Guided by the PRISMA 2020 framework, we (i) specify explicit inclusion and exclusion criteria based on citation count and research questions, (ii) catalogue datasets, architectures, and evaluation practices, and (iii) synthesise empirical evidence on the effectiveness and limitations of RAG. To mitigate citation-lag bias,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling