Partial Reasoning in Language Models: Search and Refinement Guided by Uncertainty

Murilo da Luz; Bruno Brand\~ao; Luana Martins; Gustavo Oliveira; Bryan de Oliveira; Luckeciano Melo; Telma Soares

arXiv:2601.12040·cs.AI·January 21, 2026

Partial Reasoning in Language Models: Search and Refinement Guided by Uncertainty

Murilo da Luz, Bruno Brand\~ao, Luana Martins, Gustavo Oliveira, Bryan de Oliveira, Luckeciano Melo, Telma Soares

PDF

Open Access 1 Video

TL;DR

This paper presents PREGU, a method that improves reasoning in large language models by monitoring output entropy to trigger localized search and refinement, enhancing multi-step inference accuracy.

Contribution

Introducing PREGU, a novel approach that uses entropy-based uncertainty monitoring to guide partial reasoning and refinement in language models.

Findings

01

PREGU outperforms baseline methods on multiple reasoning benchmarks.

02

Entropy effectively signals when to refine reasoning steps.

03

Localized search improves multi-step inference accuracy.

Abstract

The use of Large Language Models (LLMs) for reasoning and planning tasks has drawn increasing attention in Artificial Intelligence research. Despite their remarkable progress, these models still exhibit limitations in multi-step inference scenarios, particularly in mathematical and logical reasoning. We introduce PREGU (Partial Reasoning Guided by Uncertainty). PREGU monitors the entropy of the output distribution during autoregressive generation and halts the process whenever entropy exceeds a defined threshold, signaling uncertainty. From that point, a localized search is performed in the latent space to refine the partial reasoning and select the most coherent answer, using the Soft Reasoning method. Experiments conducted with LLaMA-3-8B, Mistral-7B, and Qwen2-7B across four reasoning benchmarks (GSM8K, GSM-Hard, SVAMP, and StrategyQA) showed performance greater than or similar to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Partial Reasoning in Language Models Search and Refinement Guided by Uncertainty· underline

Taxonomy

TopicsMultimodal Machine Learning Applications · Topic Modeling · AI-based Problem Solving and Planning