Cognitive Loop of Thought: Reversible Hierarchical Markov Chain for Efficient Mathematical Reasoning
Jia-Chen Zhang, Zheng Zhou, Yu-Jie Xiong

TL;DR
This paper introduces the Cognitive Loop of Thought (CLoT), a hierarchical reversible Markov chain framework that improves mathematical reasoning in large language models by enabling backward verification and efficient problem decomposition.
Contribution
The paper proposes a novel hierarchical reversible Markov chain model with backward reasoning and pruning, addressing memory and context limitations in multi-step reasoning.
Findings
CLoT achieves 99.0% accuracy on AddSub with GPT-4o-mini.
Outperforms traditional CoT and CoT-SC by 4.1% and 2.9%.
Enhances reasoning robustness and efficiency through hierarchical decomposition.
Abstract
Multi-step Chain-of-Thought (CoT) has significantly advanced the mathematical reasoning capabilities of LLMs by leveraging explicit reasoning steps. However, the widespread adoption of Long CoT often results in sequence lengths that exceed manageable computational limits. While existing approaches attempt to alleviate this by reducing KV Cache redundancy via Markov chain-like structures, they introduce two critical limitations: inherent memorylessness (loss of context) and limited backward reasoning capability. To address these limitations, we propose a novel Chain-of-Thought framework based on Reversible Hierarchical Markov Chain, termed Cognitive Loop of Thought (CLoT), and a backward reasoning dataset CLoT-Instruct. In CLoT, problems are decomposed into sub-problems with hierarchical dependencies. Inspired by human cognitive processes, we introduce a backward verification mechanism…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
