FedRAG: A Framework for Fine-Tuning Retrieval-Augmented Generation Systems

Val Andrei Fajardo; David B. Emerson; Amandeep Singh; Veronica Chatrath; Marcelo Lotif; Ravi Theja; Alex Cheung; Izuki Matsuba

arXiv:2506.09200·cs.LG·June 13, 2025

FedRAG: A Framework for Fine-Tuning Retrieval-Augmented Generation Systems

Val Andrei Fajardo, David B. Emerson, Amandeep Singh, Veronica Chatrath, Marcelo Lotif, Ravi Theja, Alex Cheung, Izuki Matsuba

PDF

Open Access 1 Repo

TL;DR

FedRAG is a versatile framework that enables effective fine-tuning of retrieval-augmented generation systems in both centralized and federated settings, enhancing their adaptability and performance.

Contribution

It introduces a comprehensive framework supporting state-of-the-art fine-tuning methods for RAG systems, with seamless transition between centralized and federated training.

Findings

01

Supports state-of-the-art fine-tuning methods

02

Enables seamless conversion from centralized to federated training

03

Deep integration with the modern RAG ecosystem

Abstract

Retrieval-augmented generation (RAG) systems have been shown to be effective in addressing many of the drawbacks of relying solely on the parametric memory of large language models. Recent work has demonstrated that RAG systems can be improved via fine-tuning of their retriever and generator models. In this work, we introduce FedRAG, a framework for fine-tuning RAG systems across centralized and federated architectures. FedRAG supports state-of-the-art fine-tuning methods, offering a simple and intuitive interface and a seamless conversion from centralized to federated training tasks. FedRAG is also deeply integrated with the modern RAG ecosystem, filling a critical gap in available tools.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

vectorinstitute/fed-rag
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

MethodsLayer Normalization · Linear Warmup With Linear Decay · Refunds@Expedia|||How do I get a full refund from Expedia? · Attention Dropout · Byte Pair Encoding · Softmax · Linear Layer · Dropout · Dense Connections · Attention Is All You Need