Demystifying Reasoning Dynamics with Mutual Information: Thinking Tokens are Information Peaks in LLM Reasoning

Chen Qian; Dongrui Liu; Haochen Wen; Zhen Bai; Yong Liu; Jing Shao

arXiv:2506.02867·cs.AI·June 5, 2025

Demystifying Reasoning Dynamics with Mutual Information: Thinking Tokens are Information Peaks in LLM Reasoning

Chen Qian, Dongrui Liu, Haochen Wen, Zhen Bai, Yong Liu, Jing Shao

PDF

Open Access 1 Repo

TL;DR

This paper explores the internal reasoning process of large reasoning models using mutual information, revealing peaks associated with thinking tokens and proposing methods to enhance reasoning performance.

Contribution

It introduces an information-theoretic analysis of LRM reasoning, identifies the role of thinking tokens, and proposes techniques to leverage these tokens for improved reasoning.

Findings

01

Mutual information peaks correlate with reflection tokens.

02

Thinking tokens significantly impact reasoning accuracy.

03

Proposed methods improve LRM reasoning performance.

Abstract

Large reasoning models (LRMs) have demonstrated impressive capabilities in complex problem-solving, yet their internal reasoning mechanisms remain poorly understood. In this paper, we investigate the reasoning trajectories of LRMs from an information-theoretic perspective. By tracking how mutual information (MI) between intermediate representations and the correct answer evolves during LRM reasoning, we observe an interesting MI peaks phenomenon: the MI at specific generative steps exhibits a sudden and significant increase during LRM's reasoning process. We theoretically analyze such phenomenon and show that as MI increases, the probability of model's prediction error decreases. Furthermore, these MI peaks often correspond to tokens expressing reflection or transition, such as ``Hmm'', ``Wait'' and ``Therefore,'' which we term as the thinking tokens. We then demonstrate that these…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

chnq/mi-peaks
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMulti-Agent Systems and Negotiation · Business Process Modeling and Analysis · Semantic Web and Ontologies