Thinking Traps in Long Chain-of-Thought: A Measurable Study and Trap-Aware Adaptive Restart
Kang Chen, Fan Yu, Junjie Nian, Shihan Zhao, Zhuoka Feng, Zijun Yao, Heng Wang, Minshen Yu, Yixin Cao

TL;DR
This paper identifies reasoning deadlocks called Thinking Traps in Long Chain-of-Thought processes and proposes TAAR, a test-time control method that detects and escapes these traps, improving reasoning accuracy without retraining models.
Contribution
The paper introduces TAAR, a novel trap-aware adaptive restart framework that predicts trap locations and applies targeted interventions during inference to enhance reasoning performance.
Findings
89% of failures on DAPO-MATH involve Thinking Traps
TAAR improves reasoning accuracy on multiple benchmarks
TAAR operates without fine-tuning base models
Abstract
Scaling test-time compute via Long Chain-of-Thought (Long-CoT) significantly enhances reasoning capabilities, yet extended generation does not guarantee correctness: after an early wrong commitment, models may keep elaborating a self-consistent but incorrect prefix. Through fine-grained trajectory analysis, we identify Thinking Traps, prefix-dominant deadlocks where later reflection, alternative attempts, or verification fails to revise the root error. On a curated subset of DAPO-MATH, 89\% of failures exhibit such traps. To solve this problem, we introduce TAAR (Trap-Aware Adaptive Restart), a test-time control framework that trains a diagnostic policy to predict two signals from partial trajectories: a trap index for where to truncate and an escape probability for whether and how strongly to intervene. At inference time, TAAR truncates the trajectory before the predicted trap segment…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware System Performance and Reliability · Machine Learning and Algorithms · Formal Methods in Verification
