EDIS: Diagnosing LLM Reasoning via Entropy Dynamics

Chenghua Zhu; Siyan Wu; Xiangkang Zeng; Zishan Xu; Zhaolu Kang; Yifu Guo; Yuquan Lu; Junduan Huang; Guojing Zhou

arXiv:2602.01288·cs.LG·March 9, 2026

EDIS: Diagnosing LLM Reasoning via Entropy Dynamics

Chenghua Zhu, Siyan Wu, Xiangkang Zeng, Zishan Xu, Zhaolu Kang, Yifu Guo, Yuquan Lu, Junduan Huang, Guojing Zhou

PDF

Open Access

TL;DR

This paper introduces EDIS, a new metric based on the temporal evolution of entropy during LLM reasoning, which effectively diagnoses reasoning errors and improves inference accuracy.

Contribution

The paper demonstrates that analyzing entropy trajectories provides richer insights into LLM reasoning failures and introduces EDIS, a novel instability score for better diagnosis and selection.

Findings

01

Entropy dynamics reveal patterns distinguishing correct and incorrect reasoning.

02

EDIS improves inference-time selection accuracy.

03

Entropy trajectories are consistent across models and training stages.

Abstract

Entropy-based confidence signals are increasingly leveraged to improve reasoning in large language models (LLMs), yet existing approaches treat confidence as a static quantity -- typically aggregated over tokens. We show that the \emph{temporal evolution} of confidence during generation carries richer information than aggregate statistics alone. Analyzing token-level entropy trajectories, we identify characteristic patterns distinguishing correct from incorrect reasoning: erroneous solutions exhibit unstable dynamics, including burst spikes (sustained uncertainty growth) and peak-valley spikes (sharp rebounds following transient confidence). These patterns persist across models and training stages, suggesting they reflect intrinsic properties of reasoning failure rather than superficial noise. To formalize this observation, we introduce the Entropy Dynamics Instability Score…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Text Readability and Simplification