The Stepwise Informativeness Assumption: Why are Entropy Dynamics and Reasoning Correlated in LLMs?

Mar Gonz\`alez I Catal\`a; Haitz S\'aez de Oc\'ariz Borde; George D. Monta\~nez; Pietro Li\`o

arXiv:2604.06192·cs.CL·April 9, 2026

The Stepwise Informativeness Assumption: Why are Entropy Dynamics and Reasoning Correlated in LLMs?

Mar Gonz\`alez I Catal\`a, Haitz S\'aez de Oc\'ariz Borde, George D. Monta\~nez, Pietro Li\`o

PDF

TL;DR

This paper introduces the Stepwise Informativeness Assumption (SIA), explaining why entropy dynamics in LLMs correlate with correctness, supported by theoretical derivation and empirical validation across multiple models and benchmarks.

Contribution

It formalizes SIA as a principle explaining entropy-correctness correlation, deriving observable signatures, and empirically validating it across diverse models and reasoning tasks.

Findings

01

SIA naturally emerges from maximum-likelihood training on reasoning traces.

02

Correct reasoning traces show characteristic entropy patterns.

03

Training induces the SIA in large language models.

Abstract

Recent work uses entropy-based signals at multiple representation levels to study reasoning in large language models, but the field remains largely empirical. A central unresolved puzzle is why internal entropy dynamics, defined under the predictive distribution of a model, correlate so robustly with external correctness given by the ground-truth answer. In this paper, we argue that this correlation arises because autoregressive models reason correctly when they accumulate information about the true answer via answer-informative prefixes. We formalize this intuition via the Stepwise Informativeness Assumption (SIA), which states that reasoning prefixes accumulate answer-relevant information in expectation as generation progresses. We show that SIA naturally emerges from maximum-likelihood optimization on human reasoning traces and is reinforced by standard fine-tuning and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.