Modeling Hierarchical Thinking in Large Reasoning Models
G M Shahariar, Ali Nazari, Erfan Shayegani, Nael Abu-Ghazaleh

TL;DR
This paper introduces a finite state machine framework to interpret and analyze the hierarchical reasoning processes of large language models during chain-of-thought generation, revealing distinct reasoning patterns.
Contribution
It proposes a structured FSM approach to interpret LLM reasoning, enabling visualization and comparison of reasoning states and trajectories.
Findings
Identifies key reasoning states such as deduction and backtracking.
Provides a method to annotate and analyze reasoning trajectories.
Reveals differences in reasoning patterns across models.
Abstract
Large Language Models (LLMs) have demonstrated remarkable reasoning abilities when they generate step-by-step solutions, known as chain-of-thought (CoT) reasoning. When trained to using chain-of-thought reasoning examples, the resulting models (called Large Reasoning Models, or LRMs) appear to learn hierarchical thinking strategies similar to those used by humans. However, understanding LRMs emerging reasoning capabilities remains a difficult open problem, with many potential important applications including improving training and understanding robustness. In this paper, we adopt a memoryless Finite State Machine formulation to approximate LRM's emerging hierarchical reasoning dynamics as a structured, interpretable abstraction. We identify a small set of discrete reasoning states including - initialization, deduction, augmentation-strategy, uncertainty-estimation, backtracking, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
