Mitigating Overthinking in Large Reasoning Models via Manifold Steering

Yao Huang; Huanran Chen; Shouwei Ruan; Yichi Zhang; Xingxing Wei; Yinpeng Dong

arXiv:2505.22411·cs.LG·November 18, 2025

Mitigating Overthinking in Large Reasoning Models via Manifold Steering

Yao Huang, Huanran Chen, Shouwei Ruan, Yichi Zhang, Xingxing Wei, Yinpeng Dong

PDF

Open Access

TL;DR

This paper introduces Manifold Steering, a novel technique to reduce overthinking in large reasoning models by projecting activation interventions onto a low-dimensional manifold, significantly decreasing computational overhead while maintaining accuracy.

Contribution

The paper reveals that overthinking is linked to a low-dimensional manifold in activation space and proposes a manifold projection method to effectively mitigate it.

Findings

01

Reduces output tokens by up to 71% on mathematical benchmarks

02

Maintains and improves accuracy across multiple tasks

03

Demonstrates robust transferability across domains

Abstract

Recent advances in Large Reasoning Models (LRMs) have demonstrated remarkable capabilities in solving complex tasks such as mathematics and coding. However, these models frequently exhibit a phenomenon known as overthinking during inference, characterized by excessive validation loops and redundant deliberation, leading to substantial computational overheads. In this paper, we aim to mitigate overthinking by investigating the underlying mechanisms from the perspective of mechanistic interpretability. We first showcase that the tendency of overthinking can be effectively captured by a single direction in the model's activation space and the issue can be eased by intervening the activations along this direction. However, this efficacy soon reaches a plateau and even deteriorates as the intervention strength increases. We therefore systematically explore the activation space and find that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Advanced Graph Neural Networks · Topic Modeling