EntroCoT: Enhancing Chain-of-Thought via Adaptive Entropy-Guided Segmentation

Zihang Li; Yuhang Wang; Yikun Zong; Wenhan Yu; Xiaokun Yuan; Runhan Jiang; Zirui Liu; Tong Yang; Arthur Jiang

arXiv:2601.03769·cs.AI·January 13, 2026

EntroCoT: Enhancing Chain-of-Thought via Adaptive Entropy-Guided Segmentation

Zihang Li, Yuhang Wang, Yikun Zong, Wenhan Yu, Xiaokun Yuan, Runhan Jiang, Zirui Liu, Tong Yang, Arthur Jiang

PDF

Open Access

TL;DR

EntroCoT introduces an entropy-guided segmentation and evaluation framework to improve the quality of reasoning traces in Large Language Models, leading to better mathematical reasoning performance.

Contribution

The paper presents a novel entropy-based segmentation and Monte Carlo evaluation method to refine reasoning traces, reducing hallucinations and improving dataset quality.

Findings

01

Outperforms baselines on mathematical benchmarks

02

Constructs higher-quality reasoning datasets

03

Enhances reasoning accuracy through refined supervision

Abstract

Chain-of-Thought (CoT) prompting has significantly enhanced the mathematical reasoning capabilities of Large Language Models. We find existing fine-tuning datasets frequently suffer from the "answer right but reasoning wrong" probelm, where correct final answers are derived from hallucinated, redundant, or logically invalid intermediate steps. This paper proposes EntroCoT, a unified framework for automatically identifying and refining low-quality CoT supervision traces. EntroCoT first proposes an entropy-based mechanism to segment the reasoning trace into multiple steps at uncertain junctures, and then introduces a Monte Carlo rollout-based mechanism to evaluate the marginal contribution of each step. By accurately filtering deceptive reasoning samples, EntroCoT constructs a high-quality dataset where every intermediate step in each reasoning trace facilitates the final answer.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Advanced Graph Neural Networks · Machine Learning in Healthcare