KisMATH: Do LLMs Have Knowledge of Implicit Structures in Mathematical Reasoning?

Soumadeep Saha; Akshay Chaturvedi; Saptarshi Saha; Utpal Garain; Nicholas Asher

arXiv:2507.11408·cs.CL·January 21, 2026

KisMATH: Do LLMs Have Knowledge of Implicit Structures in Mathematical Reasoning?

Soumadeep Saha, Akshay Chaturvedi, Saptarshi Saha, Utpal Garain, Nicholas Asher

PDF

Open Access

TL;DR

This paper introduces CCGraphs, a new method to analyze how large language models perform mathematical reasoning by modeling causal dependencies in their reasoning traces, revealing that models internalize similar structures.

Contribution

The paper presents CCGraphs and the KisMATH dataset, providing empirical evidence that LLMs recognize and utilize causal structures in mathematical reasoning.

Findings

01

Reasoning nodes are causal contributors to answers.

02

LLMs emphasize reasoning paths similar to CCGraphs.

03

CCGraphs enable controlled interventions in reasoning analysis.

Abstract

Chain-of-thought (CoT) traces have been shown to improve performance of large language models on a plethora of reasoning tasks, yet there is no consensus on the mechanism by which this boost is achieved. To shed more light on this, we introduce Causal CoT Graphs (CCGraphs), which are directed acyclic graphs automatically extracted from reasoning traces that model fine-grained causal dependencies in language-model outputs. A collection of 1671 mathematical reasoning problems from MATH500, GSM8K, and AIME, together with their associated CCGraphs, has been compiled into our dataset -- KisMATH. Our detailed empirical analysis with 15 open-weight LLMs shows that (i) reasoning nodes in the CCGraphs are causal contributors to the final answer, which we argue is constitutive of reasoning; and (ii) LLMs emphasize the reasoning paths captured by the CCGraphs, indicating that the models internally…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMathematics, Computing, and Information Processing · Statistics Education and Methodologies · Natural Language Processing Techniques