SELFDOUBT: Uncertainty Quantification for Reasoning LLMs via the Hedge-to-Verify Ratio
Satwik Pandey, Suresh Raghu, Shashwat Pandey

TL;DR
SELFDOUBT introduces a single-pass uncertainty estimation method for reasoning language models that relies on behavioral signals from reasoning traces, enabling efficient deployment even with proprietary APIs.
Contribution
It proposes the Hedge-to-Verify Ratio (HVR), a novel signal extracted from reasoning traces, to assess uncertainty without multiple samples or model internals.
Findings
Traces without hedging markers are 96% correct.
SELFDOUBT outperforms semantic entropy at 10x lower cost.
Achieves 90% accuracy at 71% coverage in deployment cascade.
Abstract
Uncertainty estimation for reasoning language models remains difficult to deploy in practice: sampling-based methods are computationally expensive, while common single-pass proxies such as verbalized confidence or trace length are often inconsistent across models. This problem is compounded for proprietary reasoning APIs that expose neither logits nor intermediate token probabilities, leaving practitioners with no reliable uncertainty signal at inference time. We propose SELFDOUBT, a single-pass uncertainty framework that resolves this impasse by extracting behavioral signals directly from the reasoning trace itself. Our key signal, the Hedge-to-Verify Ratio (HVR), detects whether a reasoning trace contains uncertainty markers and, if so, whether they are offset by explicit selfchecking behavior. Unlike methods that require multiple sampled traces or model internals, SELFDOUBT operates…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
