TL;DR
Stable-RAG introduces a method to reduce hallucinations caused by retrieval order sensitivity in retrieval-augmented generation, improving consistency and accuracy.
Contribution
It proposes Stable-RAG, a novel approach that mitigates permutation-induced hallucinations by aggregating reasoning across multiple retrieval orders.
Findings
Stable-RAG improves answer accuracy on three QA datasets.
It enhances reasoning consistency and generalization.
The method outperforms strong baselines in experiments.
Abstract
Retrieval-Augmented Generation (RAG) has become a key paradigm for reducing factual hallucinations in Large Language Models (LLMs), yet little is known about how the order of retrieved documents affects model behavior. We empirically show that under a Top-5 retrieval setting with the gold document included, LLM answers vary substantially across permutations of the retrieved set, even when the gold document is fixed in the first position. This reveals a previously underexplored sensitivity to retrieval permutations. Although existing robust RAG methods focus primarily on enhancing LLM robustness to low-quality retrieval and mitigating positional bias to distribute attention fairly over long contexts, neither approach directly addresses permutation sensitivity. In this paper, we propose Stable-RAG, which exploits permutation sensitivity estimation to mitigate permutation-induced…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
