Understanding Chain-of-Thought in Large Language Models via Topological Data Analysis
Chenghao Li, Chaoning Zhang, Yi Lu, Shuxu Chen, Xudong Wang, Jiaquan Zhang, Zhicheng Wang, Zhengxun Jin, Kuien Liu, Sung-Ho Bae, Guoqing Wang, Yang Yang, and Heng Tao Shen

TL;DR
This paper introduces a novel approach using Topological Data Analysis to analyze the structural properties of reasoning chains in large language models, revealing correlations between topology and reasoning accuracy.
Contribution
It is the first to apply persistent homology to study the structural mechanisms of reasoning chains in LLMs, linking topological features to reasoning performance.
Findings
Topological complexity correlates positively with accuracy.
Successful reasoning exhibits simpler topologies with less redundancy.
Topological features can identify logical gaps and improve reasoning efficiency.
Abstract
With the development of large language models (LLMs), particularly with the introduction of the long reasoning chain technique, the reasoning ability of LLMs in complex problem-solving has been significantly enhanced. While acknowledging the power of long reasoning chains, we cannot help but wonder: Why do different reasoning chains perform differently in reasoning? What components of the reasoning chains play a key role? Existing studies mainly focus on evaluating reasoning chains from a functional perspective, with little attention paid to their structural mechanisms. To address this gap, this work is the first to analyze and evaluate the quality of the reasoning chain from a structural perspective. We apply persistent homology from Topological Data Analysis (TDA) to map reasoning steps into semantic space, extract topological features, and analyze structural changes. These changes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopological and Geometric Data Analysis · Advanced Graph Neural Networks · Constraint Satisfaction and Optimization
