On the Diminishing Returns of Complex Robust RAG Training in the Era of Powerful LLMs

Hanxing Ding; Shuchang Tao; Liang Pang; Zihao Wei; Liwei Chen; Kun Xu; Huawei Shen; Xueqi Cheng

arXiv:2502.11400·cs.CL·October 6, 2025

On the Diminishing Returns of Complex Robust RAG Training in the Era of Powerful LLMs

Hanxing Ding, Shuchang Tao, Liang Pang, Zihao Wei, Liwei Chen, Kun Xu, Huawei Shen, Xueqi Cheng

PDF

Open Access

TL;DR

As language models grow more powerful, the advantages of complex robust training strategies in retrieval-augmented generation diminish, with simpler methods often achieving comparable performance on QA tasks.

Contribution

This work systematically evaluates the diminishing returns of complex robust training in RAG systems across model scales, highlighting the sufficiency of simpler approaches for large models.

Findings

01

Robustness gains decrease with larger models

02

Simpler training approaches perform well on powerful models

03

Stronger models show better calibration and generalization

Abstract

Retrieval-augmented generation (RAG) systems traditionally employ sophisticated training strategies to enhance robustness against retrieval noise. In this work, we investigate a critical question: does the benefit of these complex robust training methods diminish as language models become more powerful? Through systematic evaluation across multiple model scales and question-answering datasets, our analysis reveals a consistent trend: \emph{the marginal robustness benefit of sophisticated training strategies decreases substantially as model capacity increases.} While smaller models show significant performance improvements from complex document selection and adversarial objectives, more capable models achieve comparable or even superior performance with simpler training approaches. Further investigation demonstrates that stronger models naturally exhibit better confidence calibration,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsQuality and Safety in Healthcare · Biomedical and Engineering Education

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Byte Pair Encoding · Adam · Softmax · Dropout · Weight Decay · BART · WordPiece · Layer Normalization