AdaptThink: Reasoning Models Can Learn When to Think
Jiajie Zhang, Nianyi Lin, Lei Hou, Ling Feng, Juanzi Li

TL;DR
AdaptThink introduces a reinforcement learning approach that enables reasoning models to adaptively select between thinking and skipping thinking, significantly reducing inference costs while improving accuracy on math tasks.
Contribution
This work presents a novel RL algorithm that teaches models to choose optimal thinking modes based on problem difficulty, balancing efficiency and performance.
Findings
Reduces inference response length by 53% on average.
Improves accuracy by 2.4% on three math datasets.
Enhances efficiency without sacrificing reasoning quality.
Abstract
Recently, large reasoning models have achieved impressive performance on various tasks by employing human-like deep thinking. However, the lengthy thinking process substantially increases inference overhead, making efficiency a critical bottleneck. In this work, we first demonstrate that NoThinking, which prompts the reasoning model to skip thinking and directly generate the final solution, is a better choice for relatively simple tasks in terms of both performance and efficiency. Motivated by this, we propose AdaptThink, a novel RL algorithm to teach reasoning models to choose the optimal thinking mode adaptively based on problem difficulty. Specifically, AdaptThink features two core components: (1) a constrained optimization objective that encourages the model to choose NoThinking while maintaining the overall performance; (2) an importance sampling strategy that balances Thinking and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗THU-KEG/AdaptThink-1.5B-delta0.05model· 5 dl5 dl
- 🤗THU-KEG/AdaptThink-7B-delta0.05model· 8 dl· ♡ 18 dl♡ 1
- 🤗THU-KEG/AdaptThink-1.5B-delta0model· 3 dl3 dl
- 🤗THU-KEG/AdaptThink-1.5B-delta0.01model· 2 dl· ♡ 12 dl♡ 1
- 🤗THU-KEG/AdaptThink-1.5B-delta0.02model· 5 dl5 dl
- 🤗THU-KEG/AdaptThink-1.5B-delta0.075model· 1 dl1 dl
- 🤗THU-KEG/AdaptThink-1.5B-delta0.1model· 9 dl· ♡ 29 dl♡ 2
Videos
Taxonomy
TopicsMultimodal Machine Learning Applications · Explainable Artificial Intelligence (XAI) · Topic Modeling
