Controlling Thinking Speed in Reasoning Models

Zhengkai Lin; Zhihang Fu; Ze Chen; Chao Chen; Liang Xie; Wenxiao Wang; Deng Cai; Zheng Wang; Jieping Ye

arXiv:2507.03704·cs.CL·October 31, 2025

Controlling Thinking Speed in Reasoning Models

Zhengkai Lin, Zhihang Fu, Ze Chen, Chao Chen, Liang Xie, Wenxiao Wang, Deng Cai, Zheng Wang, Jieping Ye

PDF

Open Access 1 Video

TL;DR

This paper introduces a method for Large Reasoning Models to dynamically adjust their thinking speed, balancing accuracy and efficiency by controlling internal representations and estimating problem difficulty in real-time.

Contribution

It presents the first approach to control reasoning speed in LRMs through representation editing and real-time difficulty estimation, improving performance without additional training.

Findings

01

Achieves +1.3% accuracy on reasoning benchmarks

02

Reduces token usage by 8.6% on average

03

Outperforms existing prompt-based scaling methods

Abstract

Human cognition is theorized to operate in two modes: fast, intuitive System 1 thinking and slow, deliberate System 2 thinking. While current Large Reasoning Models (LRMs) excel at System 2 thinking, their inability to perform fast thinking leads to high computational overhead and latency. In this work, we enable LRMs to approximate human intelligence through dynamic thinking speed adjustment, optimizing accuracy-efficiency trade-offs. Our approach addresses two key questions: (1) how to control thinking speed in LRMs, and (2) when to adjust it for optimal performance. For the first question, we identify the steering vector that governs slow-fast thinking transitions in LRMs' representation space. Using this vector, we achieve the first representation editing-based test-time scaling effect, outperforming existing prompt-based scaling methods. For the second question, we apply real-time…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Controlling Thinking Speed in Reasoning Models· slideslive

Taxonomy

TopicsBayesian Modeling and Causal Inference · AI-based Problem Solving and Planning · Constraint Satisfaction and Optimization