Improving reasoning at inference time via uncertainty minimisation
Nicolas Legrand, Kenneth Enevoldsen, M\'arton Kardos, Kristoffer Nielbo

TL;DR
This paper introduces a novel inference-time method for reasoning in large language models that maximizes internal self-certainty at each step, improving accuracy with fewer samples and better transferability across languages.
Contribution
The proposed approach frames reasoning as uncertainty minimization at the thought level, operating solely on internal model signals, and demonstrates superior performance over existing methods.
Findings
Outperforms greedy decoding and matches self-consistency with fewer samples
Transfers effectively across multiple languages and model sizes
Early reasoning decisions are predictive of final accuracy
Abstract
Large language models (LLMs) now exhibit strong multi-step reasoning abilities, but existing inference-time scaling methods remain computationally expensive, often relying on extensive sampling or external evaluators. We propose a principled strategy that frames reasoning as uncertainty minimisation and operates at the level of individual thoughts rather than tokens. Our method selects, at each reasoning step, the continuation that maximizes the model's self-certainty, a metric computed from its internal predictive distribution. This approach achieves significant improvement with a small number of samples, relies exclusively on model-internal signals, and applies to open-ended questions as opposed to methods like majority voting. Experiments on MATH500 and GSM8K across multiple model sizes demonstrate that thought-level self-certainty maximization consistently outperforms greedy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Explainable Artificial Intelligence (XAI)
