Mitigating Overthinking in Large Reasoning Language Models via Reasoning Path Deviation Monitoring

Weixin Guan; Liang Li; Jiapeng Liu; Bing Li; Peng Fu; Chengyang Fang; Xiaoshuai Hao; Can Ma; Weiping Wang

arXiv:2603.14251·cs.CL·March 17, 2026

Mitigating Overthinking in Large Reasoning Language Models via Reasoning Path Deviation Monitoring

Weixin Guan, Liang Li, Jiapeng Liu, Bing Li, Peng Fu, Chengyang Fang, Xiaoshuai Hao, Can Ma, Weiping Wang

PDF

Open Access

TL;DR

This paper introduces a novel early-exit method for large reasoning language models that monitors reasoning path deviations via high-entropy tokens to effectively mitigate overthinking, improving performance and efficiency.

Contribution

The proposed method couples early-exit with native reasoning by using a path deviation index based on high-entropy tokens, reducing overthinking without extra training overhead.

Findings

01

Significant performance gains over vanilla Chain-of-Thought methods.

02

Effective detection and termination of overthinking trajectories.

03

Improved efficiency in reasoning tasks across multiple benchmarks.

Abstract

Large Reasoning Language Models (LRLMs) demonstrate impressive capabilities on complex tasks by utilizing long Chain-of-Thought reasoning. However, they are prone to overthinking, which generates redundant reasoning steps that degrade both performance and efficiency. Recently, early-exit strategies are proposed to mitigate overthinking by dynamically and adaptively terminating redundant reasoning. However, current early-exit methods either introduce extra training overhead by relying on proxy models or limit inference throughput due to the frequent content switching between reasoning and generating probing answers. Moreover, most early-exit methods harm LRLMs performance due to over-truncation. Our insight stems from an observation: overthinking often causes LRLMs to deviate from the correct reasoning path, which is frequently accompanied by high-entropy transition tokens. Given this,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Advanced Graph Neural Networks · Multimodal Machine Learning Applications