TERMINATOR: Learning Optimal Exit Points for Early Stopping in Chain-of-Thought Reasoning

Alliot Nagle; Jakhongir Saydaliev; Dhia Garbaya; Michael Gastpar; Ashok Vardhan Makkuva; Hyeji Kim

arXiv:2603.12529·cs.LG·May 15, 2026

TERMINATOR: Learning Optimal Exit Points for Early Stopping in Chain-of-Thought Reasoning

Alliot Nagle, Jakhongir Saydaliev, Dhia Garbaya, Michael Gastpar, Ashok Vardhan Makkuva, Hyeji Kim

PDF

2 Models

TL;DR

This paper introduces Terminator, a strategy for early stopping in large reasoning models that reduces reasoning length and inference time without sacrificing accuracy by predicting optimal exit points.

Contribution

The paper proposes a novel early-exit method trained on a new dataset to effectively determine when to stop reasoning in large models, reducing overthinking and computation.

Findings

01

Achieves 14%-55% reduction in reasoning length across four datasets.

02

Reduces inference latency by more than 2x compared to baseline models.

03

Outperforms current state-of-the-art early stopping methods.

Abstract

Large Reasoning Models (LRMs) achieve impressive performance on complex reasoning tasks via Chain-of-Thought (CoT) reasoning, which enables them to generate intermediate thinking tokens before arriving at the final answer. However, LRMs often suffer from significant overthinking, spending excessive compute time even after the answer is generated early on. Prior work has identified the existence of an optimal reasoning length such that truncating reasoning at this point significantly shortens CoT outputs with virtually no change in performance. However, determining optimal CoT lengths for practical datasets is highly non-trivial as they are fully task and model-dependent. In this paper, we precisely address this and design Terminator, an early-exit strategy for LRMs at inference to mitigate overthinking. The central idea underpinning Terminator is that the first arrival of an LRM's final…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.