Not All Errors Are Created Equal: ASCoT Addresses Late-Stage Fragility in Efficient LLM Reasoning

Dongxu Zhang; Yujun Wu; Yiding Sun; Jinnan Yang; Ning Yang; Jihua Zhu; Miao Xin; Baoliang Tian

arXiv:2508.05282·cs.CL·March 4, 2026

Not All Errors Are Created Equal: ASCoT Addresses Late-Stage Fragility in Efficient LLM Reasoning

Dongxu Zhang, Yujun Wu, Yiding Sun, Jinnan Yang, Ning Yang, Jihua Zhu, Miao Xin, Baoliang Tian

PDF

TL;DR

This paper identifies late-stage errors as more damaging in LLM reasoning and introduces ASCoT, a method that improves efficiency and robustness by focusing verification on critical late reasoning steps, reducing resource use with minimal accuracy loss.

Contribution

The paper proposes ASCoT, a novel approach combining semantic pruning and adaptive verification to address late-stage fragility in LLM reasoning, enhancing efficiency and reliability.

Findings

01

Reduces token usage by 21-30% on GSM8K and MATH-500

02

Maintains accuracy with less than 1.8% drop

03

Effectively reallocates computational resources to critical reasoning steps

Abstract

While Chain-of-Thought (CoT) prompting empowers Large Language Models (LLMs), ensuring reasoning reliability remains an open challenge. Contrary to the prevailing cascading failure hypothesis which posits that early errors are most detrimental, we identify a counter-intuitive phenomenon termed \textbf{Late-Stage Fragility}: errors introduced in later reasoning stages are significantly more prone to corrupting final answers. To address this, we introduce ASCoT (Adaptive Self-Correction Chain-of-Thought), a method harmonizing efficiency with robust verification. ASCoT first employs semantic pruning to compress redundant steps, then utilizes an Adaptive Verification Manager (AVM) to prioritize high risk, late-stage steps via a positional impact score, triggering a Multi-Perspective Self-Correction Engine (MSCE) only when necessary. Experiments on GSM8K and MATH-500 demonstrate that ASCoT…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.