Beyond Semantic Relevance: Counterfactual Risk Minimization for Robust Retrieval-Augmented Generation

Peiyang Liu; Qiang Yan; Ziqiang Cui; Di Liang; Xi Wang; Wei Ye

arXiv:2605.01302·cs.CL·May 5, 2026

Beyond Semantic Relevance: Counterfactual Risk Minimization for Robust Retrieval-Augmented Generation

Peiyang Liu, Qiang Yan, Ziqiang Cui, Di Liang, Xi Wang, Wei Ye

PDF

1 Repo 1 Models

TL;DR

This paper introduces CoRM-RAG, a retrieval framework that prioritizes decision safety over semantic relevance, improving robustness against user biases and adversarial queries in retrieval-augmented generation systems.

Contribution

It proposes a causal intervention-based training protocol and a lightweight Evidence Critic to enhance retrieval robustness and risk-awareness in RAG systems.

Findings

01

CoRM-RAG outperforms existing retrievers in adversarial settings.

02

The Evidence Critic effectively identifies evidentially strong documents.

03

The framework enables risk-aware abstention for safer decision-making.

Abstract

Standard Retrieval-Augmented Generation (RAG) systems predominantly rely on semantic relevance as a proxy for utility. However, this assumption collapses in realistic decision-making scenarios where user queries are laden with cognitive biases, such as false premises or confirmation bias. In such cases, maximizing relevance paradoxically promotes the retrieval of sycophantic evidence that reinforces hallucinations, a critical failure we term the ``Relevance-Robustness Gap''. To bridge this gap, we propose CoRM-RAG (Counterfactual Risk Minimization for RAG), a framework that aligns retrieval with decision safety rather than mere similarity. Grounded in causal intervention, we introduce a Cognitive Perturbation Protocol to simulate user biases during training, which is then distilled into a lightweight Evidence Critic. This scoring module learns to identify documents that possess…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

PeiYangLiu/CoRM-RAG.git
github

Models

🤗
PeiyangLiu/CoRM-RAG
model

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.