RbFT: Robust Fine-tuning for Retrieval-Augmented Generation against Retrieval Defects
Yiteng Tu, Weihang Su, Yujia Zhou, Yiqun Liu, Qingyao Ai

TL;DR
This paper introduces RbFT, a fine-tuning approach that significantly improves the robustness of retrieval-augmented generation systems against retrieval errors and noise, ensuring more reliable language model outputs.
Contribution
The paper proposes RbFT, a novel fine-tuning method that enhances LLM resilience to retrieval defects, outperforming existing approaches in robustness while maintaining efficiency.
Findings
RbFT improves robustness across various retrieval conditions.
It surpasses existing methods in handling retrieval noise.
Maintains high inference efficiency and compatibility.
Abstract
Retrieval-augmented generation (RAG) enhances large language models (LLMs) by integrating external knowledge retrieved from a knowledge base. However, its effectiveness is fundamentally constrained by the reliability of both the retriever and the knowledge base. In real-world scenarios, imperfections in these components often lead to the retrieval of noisy, irrelevant, or misleading counterfactual information, ultimately undermining the trustworthiness of RAG systems. To address this challenge, we propose Robust Fine-Tuning (RbFT), a method designed to enhance the resilience of LLMs against retrieval defects through two targeted fine-tuning tasks. Experimental results demonstrate that RbFT significantly improves the robustness of RAG systems across diverse retrieval conditions, surpassing existing methods while maintaining high inference efficiency and compatibility with other…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Advanced Data Compression Techniques · Speech and Audio Processing
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Layer Normalization · Linear Warmup With Linear Decay · Adam · Softmax · Dropout · Byte Pair Encoding · Residual Connection · WordPiece
