EntroCut: Entropy-Guided Adaptive Truncation for Efficient Chain-of-Thought Reasoning in Small-scale Large Reasoning Models

Hongxi Yan; Qingjie Liu; Yunhong Wang

arXiv:2601.22617·cs.AI·February 2, 2026

EntroCut: Entropy-Guided Adaptive Truncation for Efficient Chain-of-Thought Reasoning in Small-scale Large Reasoning Models

Hongxi Yan, Qingjie Liu, Yunhong Wang

PDF

Open Access

TL;DR

EntroCut is a training-free, entropy-based method that dynamically truncates reasoning in large models, significantly reducing computational costs while maintaining high accuracy.

Contribution

We introduce EntroCut, a novel entropy-guided dynamic truncation technique for efficient reasoning in large models, with a new metric EPR for evaluating efficiency-accuracy trade-offs.

Findings

01

Reduces token usage by up to 40% with minimal accuracy loss

02

Outperforms existing training-free truncation methods

03

Demonstrates practical efficiency improvements in four benchmarks

Abstract

Large Reasoning Models (LRMs) excel at complex reasoning tasks through extended chain-of-thought generation, but their reliance on lengthy intermediate steps incurs substantial computational cost. We find that the entropy of the model's output distribution in early reasoning steps reliably distinguishes correct from incorrect reasoning. Motivated by this observation, we propose EntroCut, a training-free method that dynamically truncates reasoning by identifying high-confidence states where reasoning can be safely terminated. To comprehensively evaluate the trade-off between efficiency and accuracy, we introduce the Efficiency-Performance Ratio (EPR), a unified metric that quantifies relative token savings per unit accuracy loss. Experiments on four benchmarks show that EntroCut reduces token usage by up to 40\% with minimal accuracy sacrifice, achieving superior efficiency-performance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Machine Learning in Healthcare · Multimodal Machine Learning Applications