AdaptThink: Reasoning Models Can Learn When to Think

Jiajie Zhang; Nianyi Lin; Lei Hou; Ling Feng; Juanzi Li

arXiv:2505.13417·cs.CL·May 20, 2025

AdaptThink: Reasoning Models Can Learn When to Think

Jiajie Zhang, Nianyi Lin, Lei Hou, Ling Feng, Juanzi Li

PDF

Open Access 1 Repo 7 Models 1 Video

TL;DR

AdaptThink introduces a reinforcement learning approach that enables reasoning models to adaptively select between thinking and skipping thinking, significantly reducing inference costs while improving accuracy on math tasks.

Contribution

This work presents a novel RL algorithm that teaches models to choose optimal thinking modes based on problem difficulty, balancing efficiency and performance.

Findings

01

Reduces inference response length by 53% on average.

02

Improves accuracy by 2.4% on three math datasets.

03

Enhances efficiency without sacrificing reasoning quality.

Abstract

Recently, large reasoning models have achieved impressive performance on various tasks by employing human-like deep thinking. However, the lengthy thinking process substantially increases inference overhead, making efficiency a critical bottleneck. In this work, we first demonstrate that NoThinking, which prompts the reasoning model to skip thinking and directly generate the final solution, is a better choice for relatively simple tasks in terms of both performance and efficiency. Motivated by this, we propose AdaptThink, a novel RL algorithm to teach reasoning models to choose the optimal thinking mode adaptively based on problem difficulty. Specifically, AdaptThink features two core components: (1) a constrained optimization objective that encourages the model to choose NoThinking while maintaining the overall performance; (2) an importance sampling strategy that balances Thinking and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

thu-keg/adaptthink
pytorchOfficial

Models

Videos

AdaptThink: Reasoning Models Can Learn When to Think· underline

Taxonomy

TopicsMultimodal Machine Learning Applications · Explainable Artificial Intelligence (XAI) · Topic Modeling