Quantifying and Understanding Uncertainty in Large Reasoning Models

Yangyi Li; Chenxu Zhao; and Mengdi Huai

arXiv:2604.13395·cs.AI·April 16, 2026

Quantifying and Understanding Uncertainty in Large Reasoning Models

Yangyi Li, Chenxu Zhao, and Mengdi Huai

PDF

TL;DR

This paper introduces a new method to quantify uncertainty in large reasoning models with statistical guarantees, addressing logical and training factors affecting reasoning and answer correctness.

Contribution

It proposes a novel uncertainty quantification approach for reasoning-answer structures and a Shapley value-based explanation framework with theoretical guarantees.

Findings

01

The method provides statistically rigorous uncertainty sets for reasoning models.

02

The explanation framework identifies key training examples and reasoning steps to preserve guarantees.

03

Experiments demonstrate the effectiveness of the proposed methods on challenging datasets.

Abstract

Large Reasoning Models (LRMs) have recently demonstrated significant improvements in complex reasoning. While quantifying generation uncertainty in LRMs is crucial, traditional methods are often insufficient because they do not provide finite-sample guarantees for reasoning-answer generation. Conformal prediction (CP) stands out as a distribution-free and model-agnostic methodology that constructs statistically rigorous uncertainty sets. However, existing CP methods ignore the logical connection between the reasoning trace and the final answer. Additionally, prior studies fail to interpret the origins of uncertainty coverage for LRMs as they typically overlook the specific training factors driving valid reasoning. Notably, it is challenging to disentangle reasoning quality from answer correctness when quantifying uncertainty, while simultaneously establishing theoretical guarantees for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.