Overclocking LLM Reasoning: Monitoring and Controlling Thinking Path Lengths in LLMs
Roy Eisenstadt, Itamar Zimerman, Lior Wolf

TL;DR
This paper investigates how large language models (LLMs) can monitor and control the length of their reasoning processes to improve accuracy and efficiency by manipulating internal progress signals during explicit reasoning.
Contribution
It introduces a method to visualize and manipulate LLM reasoning progress, enabling more concise and accurate thought processes during inference.
Findings
Overclocking reduces unnecessary reasoning steps.
Manipulating progress signals improves answer accuracy.
Method decreases inference latency.
Abstract
Recently, techniques such as explicit structured reasoning have demonstrated strong test-time scaling behavior by enforcing a separation between the model's internal "thinking" process and the final response. A key factor influencing answer quality in this setting is the length of the thinking stage. When the reasoning is too short, the model may fail to capture the complexity of the task. Conversely, when it is too long, the model may overthink, leading to unnecessary computation and degraded performance. This paper explores and exploits the underlying mechanisms by which LLMs understand and regulate the length of their reasoning during explicit thought processes. First, we show that LLMs encode their progress through the reasoning process and introduce an interactive progress bar visualization, which is then used to reveal insights on the model's planning dynamics. Second, we…
Peer Reviews
Decision·Submitted to ICLR 2026
- The paper has some intuitive figures showing the discovered "thinking progress vector".
- The idea of estimating the progress of thinking as proposed in this paper does not make sense. It is beyond me why an LLM can maintain a state representing its reasoning progress if it does not know the correct answer to the problem even when the reasoning is just started. If an estimation of this really existed, then we would have asked this LLM to solve the halting problem, which has been mathematically proven to be impossible. - The TPV might represent some characteristic tokens marking th
1. The idea of identifying the intrinsic mechanisms that encode a model’s relative position within its internal reasoning process is very interesting and useful. 2. The proposal of learning a 'progress vector' is simple yet effective. 3. The extensive expeirments well demonstrate the effectiveness of the proposed solution.
1. The authors only choose mathematical reasoning task for study. It is only a specific subarea of LLM reasoning. More reasoning tasks from other domains should be investigated.
1. The central idea of identifying an internal representation of "progress" and visualizing it as a progress bar is novel and presents an interesting direction for LLM interpretability and human-computer interaction. 2. The authors provide a thorough set of experiments on mathematical reasoning benchmarks, providing a solid empirical grounding for the paper's claims within its chosen domain.
1. Insufficient Evidence for "Monitoring" vs. "Pattern-Matching": The paper's primary claim that the TPV captures the model's monitoring of its own reasoning process, is not sufficiently substantiated. The TPV is trained as a simple regressor mapping hidden states to a normalized token position ($j/N_k$). It is highly probable that this regressor is not learning a high-level, semantic representation of "reasoning progress" but is instead capturing superficial, correlational text patterns. For ex
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLogic, Reasoning, and Knowledge · AI-based Problem Solving and Planning · Topic Modeling
