Only Say What You Know: Calibration-Aware Generation for Long-Form Factuality

Wen Luo; Guangyue Peng; Liang Wang; Nan Yang; Wei Li; Yuhan Song; Shaohang Wei; Feifan Song; Furu Wei; Houfeng Wang

arXiv:2605.01749·cs.CL·May 5, 2026

Only Say What You Know: Calibration-Aware Generation for Long-Form Factuality

Wen Luo, Guangyue Peng, Liang Wang, Nan Yang, Wei Li, Yuhan Song, Shaohang Wei, Feifan Song, Furu Wei, Houfeng Wang

PDF

TL;DR

This paper introduces a new paradigm for long-form generation called Exploration-Commitment Decoupling, enabling models to explore information with awareness and generate more factual, reliable content.

Contribution

It proposes the Calibration-Aware Generation framework that decouples exploration from final commitment, improving factuality and efficiency in long-form generation.

Findings

01

CAG improves factuality by up to 13% across benchmarks.

02

CAG reduces decoding time by up to 37%.

03

Decoupling exploration and commitment enhances reliability in generation.

Abstract

Large Reasoning Models achieve strong performance on complex tasks but remain prone to hallucinations, particularly in long-form generation where errors compound across reasoning steps. Existing approaches to improving factuality, including abstention and factuality-driven optimization, follow a \emph{coupled exploration-commitment} paradigm, in which intermediate reasoning is unconditionally propagated to the final output, limiting fine-grained control over information selection and integration. In this paper, we propose an \textbf{Exploration-Commitment Decoupling} paradigm that disentangles knowledge exploration from final commitment, enabling models to explore with awareness while answering cautiously. We instantiate the paradigm with \textbf{Calibration-Aware Generation (CAG)}, a framework that equips models with end-to-end, calibration-aware generation capabilities, by augmenting…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.