Federated Retrieval-Augmented Generation: A Systematic Mapping Study

Abhijit Chakraborty; Chahana Dahal; Vivek Gupta

arXiv:2505.18906·cs.CL·September 3, 2025

Federated Retrieval-Augmented Generation: A Systematic Mapping Study

Abhijit Chakraborty, Chahana Dahal, Vivek Gupta

PDF

Open Access

TL;DR

This paper systematically maps the emerging field of Federated Retrieval-Augmented Generation, analyzing its research trends, architectural patterns, and key challenges to guide future developments in privacy-preserving NLP.

Contribution

It is the first comprehensive systematic mapping study of Federated RAG, classifying research focuses, contributions, and identifying open challenges in the field.

Findings

01

Identifies key architectural patterns and trends in Federated RAG research.

02

Highlights privacy, heterogeneity, and evaluation as main challenges.

03

Provides a structured overview to guide future research directions.

Abstract

Federated Retrieval-Augmented Generation (Federated RAG) combines Federated Learning (FL), which enables distributed model training without exposing raw data, with Retrieval-Augmented Generation (RAG), which improves the factual accuracy of language models by grounding outputs in external knowledge. As large language models are increasingly deployed in privacy-sensitive domains such as healthcare, finance, and personalized assistance, Federated RAG offers a promising framework for secure, knowledge-intensive natural language processing (NLP). To the best of our knowledge, this paper presents the first systematic mapping study of Federated RAG, covering literature published between 2020 and 2025. Following Kitchenham's guidelines for evidence-based software engineering, we develop a structured classification of research focuses, contribution types, and application domains. We analyze…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Data Storage Technologies · Data Quality and Management

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Attention Dropout · Softmax · WordPiece · Weight Decay · Multi-Head Attention · Layer Normalization · Byte Pair Encoding