Mathematical Reasoning via Intervention-Based Time-Series Causal Discovery Using LLMs as Concept Mastery Simulators
Tsuyoshi Okita

TL;DR
This paper introduces CIKA, a framework that uses LLMs as interventional simulators to identify causally relevant concepts in mathematical reasoning, improving interpretability and performance.
Contribution
It formalizes the Interventional Capability Probe (ICP) to distinguish causally relevant concepts and demonstrates its effectiveness in enhancing LLM mathematical reasoning.
Findings
ICP significantly discriminates causally relevant concepts from irrelevant ones.
Solved problems have 6.1× higher ICP effect than unsolved ones.
CIKA improves LLM performance on multiple math benchmarks.
Abstract
Recent methods for improving LLM mathematical reasoning, whether through MCTS-based test-time search or causal graph-guided knowledge injection, cannot identify which concepts causally contribute to a correct answer, as the observed association may be spurious, driven by confounders such as problem difficulty. We propose CIKA (Causal Intervention for Knowledge Activation), a framework that uses the LLM itself as an interventional simulator: a prompt sets the concept state to ``mastered'' and the correctness change estimates the causal effect. We formalize this quantity as an Interventional Capability Probe (ICP), which diagnoses whether the LLM can use a given concept -- distinct from merely possessing knowledge. Because the intervention exogenously sets the concept state independently of problem difficulty, ICP separates confounding that observational methods cannot. On 67 screened…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
