Adaptive Deep Reasoning: Triggering Deep Thinking When Needed

Yunhao Wang; Yuhao Zhang; Tinghao Yu; Can Xu; Feng Zhang; Fengzong Lian

arXiv:2505.20101·cs.CL·May 28, 2025

Adaptive Deep Reasoning: Triggering Deep Thinking When Needed

Yunhao Wang, Yuhao Zhang, Tinghao Yu, Can Xu, Feng Zhang, Fengzong Lian

PDF

Open Access

TL;DR

This paper introduces an adaptive reasoning approach for large language models that dynamically switches between short and long reasoning chains based on problem complexity, improving efficiency without sacrificing accuracy.

Contribution

It proposes a novel reinforcement learning framework with adaptive reward strategies and reasoning mode switching loss to enable autonomous reasoning mode selection.

Findings

01

Effective dynamic switching between reasoning modes demonstrated on mathematical datasets.

02

Maintains high accuracy while reducing computational costs.

03

Enhances reasoning efficiency for real-world applications.

Abstract

Large language models (LLMs) have shown impressive capabilities in handling complex tasks through long-chain reasoning. However, the extensive reasoning steps involved can significantly increase computational costs, posing challenges for real-world deployment. Recent efforts have focused on optimizing reasoning efficiency by shortening the Chain-of-Thought (CoT) reasoning processes through various approaches, such as length-aware prompt engineering, supervised fine-tuning on CoT data with variable lengths, and reinforcement learning with length penalties. Although these methods effectively reduce reasoning length, they still necessitate an initial reasoning phase. More recent approaches have attempted to integrate long-chain and short-chain reasoning abilities into a single model, yet they still rely on manual control to toggle between short and long CoT. In this work, we propose a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIntelligent Tutoring Systems and Adaptive Learning

MethodsBalanced Selection