GRACE: Reinforcement Learning for Grounded Response and Abstention under Contextual Evidence

Yibo Zhao; Jiapeng Zhu; Zichen Ding; Xiang Li

arXiv:2601.04525·cs.CL·January 9, 2026

GRACE: Reinforcement Learning for Grounded Response and Abstention under Contextual Evidence

Yibo Zhao, Jiapeng Zhu, Zichen Ding, Xiang Li

PDF

Open Access

TL;DR

GRACE is a reinforcement learning framework that improves retrieval-augmented language models by ensuring grounded responses and reliable abstention, reducing annotation costs and enhancing accuracy.

Contribution

It introduces a unified reinforcement learning approach with a novel data construction and reward system to address grounding and abstention issues simultaneously.

Findings

01

Achieves state-of-the-art accuracy on benchmarks.

02

Balances response correctness and abstention effectively.

03

Reduces annotation costs by 90%.

Abstract

Retrieval-Augmented Generation (RAG) integrates external knowledge to enhance Large Language Models (LLMs), yet systems remain susceptible to two critical flaws: providing correct answers without explicit grounded evidence and producing fabricated responses when the retrieved context is insufficient. While prior research has addressed these issues independently, a unified framework that integrates evidence-based grounding and reliable abstention is currently lacking. In this paper, we propose GRACE, a reinforcement-learning framework that simultaneously mitigates both types of flaws. GRACE employs a data construction method that utilizes heterogeneous retrievers to generate diverse training samples without manual annotation. A multi-stage gated reward function is then employed to train the model to assess evidence sufficiency, extract key supporting evidence, and provide answers or…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Natural Language Processing Techniques