Let LRMs Break Free from Overthinking via Self-Braking Tuning

Haoran Zhao; Yuchen Yan; Yongliang Shen; Haolei Xu; Wenqi Zhang; Kaitao Song; Jian Shao; Weiming Lu; Jun Xiao; Yueting Zhuang

arXiv:2505.14604·cs.CL·October 31, 2025

Let LRMs Break Free from Overthinking via Self-Braking Tuning

Haoran Zhao, Yuchen Yan, Yongliang Shen, Haolei Xu, Wenqi Zhang, Kaitao Song, Jian Shao, Weiming Lu, Jun Xiao, Yueting Zhuang

PDF

Open Access 2 Repos

TL;DR

This paper introduces Self-Braking Tuning, a novel method enabling large reasoning models to self-regulate their reasoning length, reducing redundant computation and overthinking without external intervention, while maintaining high accuracy.

Contribution

The paper proposes a self-regulation framework for LRMs that detects overthinking and learns to terminate reasoning adaptively, improving efficiency without sacrificing performance.

Findings

01

Reduces token consumption by up to 60%.

02

Maintains comparable accuracy to unconstrained models.

03

Effective across multiple mathematical benchmarks.

Abstract

Large reasoning models (LRMs), such as OpenAI o1 and DeepSeek-R1, have significantly enhanced their reasoning capabilities by generating longer chains of thought, demonstrating outstanding performance across a variety of tasks. However, this performance gain comes at the cost of a substantial increase in redundant reasoning during the generation process, leading to high computational overhead and exacerbating the issue of overthinking. Although numerous existing approaches aim to address the problem of overthinking, they often rely on external interventions. In this paper, we propose a novel framework, Self-Braking Tuning (SBT), which tackles overthinking from the perspective of allowing the model to regulate its own reasoning process, thus eliminating the reliance on external control mechanisms. We construct a set of overthinking identification metrics based on standard answers and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital Rights Management and Security

MethodsSparse Evolutionary Training