AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

Junyu Zhang; Runpei Dong; Han Wang; Xuying Ning; Haoran Geng; Peihao Li; Xialin He; Yutong Bai; Jitendra Malik; Saurabh Gupta; Huan Zhang

arXiv:2505.24863·cs.CL·June 2, 2025

AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

Junyu Zhang, Runpei Dong, Han Wang, Xuying Ning, Haoran Geng, Peihao Li, Xialin He, Yutong Bai, Jitendra Malik, Saurabh Gupta, Huan Zhang

PDF

Open Access 1 Video

TL;DR

AlphaOne introduces a universal framework that dynamically balances slow and fast reasoning in large models at test time, improving reasoning accuracy and efficiency across diverse tasks.

Contribution

It proposes a novel scalable reasoning modulation method using the alpha moment and stochastic scheduling, unifying and extending existing approaches.

Findings

01

Outperforms existing methods on mathematical benchmarks

02

Enhances reasoning efficiency without sacrificing accuracy

03

Demonstrates versatility across multiple scientific domains

Abstract

This paper presents AlphaOne ( $α$ 1), a universal framework for modulating reasoning progress in large reasoning models (LRMs) at test time. $α$ 1 first introduces $α$ moment, which represents the scaled thinking phase with a universal parameter $α$ . Within this scaled pre- $α$ moment phase, it dynamically schedules slow thinking transitions by modeling the insertion of reasoning transition tokens as a Bernoulli stochastic process. After the $α$ moment, $α$ 1 deterministically terminates slow thinking with the end-of-thinking token, thereby fostering fast reasoning and efficient answer generation. This approach unifies and generalizes existing monotonic scaling methods by enabling flexible and dense slow-to-fast reasoning modulation. Extensive empirical studies on various challenging benchmarks across mathematical, coding, and scientific domains…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time· underline

Taxonomy

TopicsSemantic Web and Ontologies · Software Testing and Debugging Techniques