Improving reasoning at inference time via uncertainty minimisation

Nicolas Legrand; Kenneth Enevoldsen; M\'arton Kardos; Kristoffer Nielbo

arXiv:2603.07159·cs.AI·March 10, 2026

Improving reasoning at inference time via uncertainty minimisation

Nicolas Legrand, Kenneth Enevoldsen, M\'arton Kardos, Kristoffer Nielbo

PDF

Open Access

TL;DR

This paper introduces a novel inference-time method for reasoning in large language models that maximizes internal self-certainty at each step, improving accuracy with fewer samples and better transferability across languages.

Contribution

The proposed approach frames reasoning as uncertainty minimization at the thought level, operating solely on internal model signals, and demonstrates superior performance over existing methods.

Findings

01

Outperforms greedy decoding and matches self-consistency with fewer samples

02

Transfers effectively across multiple languages and model sizes

03

Early reasoning decisions are predictive of final accuracy

Abstract

Large language models (LLMs) now exhibit strong multi-step reasoning abilities, but existing inference-time scaling methods remain computationally expensive, often relying on extensive sampling or external evaluators. We propose a principled strategy that frames reasoning as uncertainty minimisation and operates at the level of individual thoughts rather than tokens. Our method selects, at each reasoning step, the continuation that maximizes the model's self-certainty, a metric computed from its internal predictive distribution. This approach achieves significant improvement with a small number of samples, relies exclusively on model-internal signals, and applies to open-ended questions as opposed to methods like majority voting. Experiments on MATH500 and GSM8K across multiple model sizes demonstrate that thought-level self-certainty maximization consistently outperforms greedy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Explainable Artificial Intelligence (XAI)