CAMA: Enhancing Mathematical Reasoning in Large Language Models with Causal Knowledge
Lei Zan, Keli Zhang, Ruichu Cai, Lujia Pan

TL;DR
CAMA introduces a causal graph-based framework to improve large language models' mathematical reasoning by explicitly modeling and utilizing causal structures, leading to significant performance enhancements on complex problems.
Contribution
This work presents a novel two-stage causal framework that constructs and refines a mathematical causal graph to guide LLM reasoning, a new approach for enhancing mathematical problem-solving.
Findings
CAMA significantly improves LLM performance on mathematical tasks.
Structured causal guidance outperforms unstructured methods.
Asymmetric causal relationships yield greater improvements.
Abstract
Large Language Models (LLMs) have demonstrated strong performance across a wide range of tasks, yet they still struggle with complex mathematical reasoning, a challenge fundamentally rooted in deep structural dependencies. To address this challenge, we propose \textbf{CA}usal \textbf{MA}thematician (\textbf{CAMA}), a two-stage causal framework that equips LLMs with explicit, reusable mathematical structure. In the learning stage, CAMA first constructs the \textbf{M}athematical \textbf{C}ausal \textbf{G}raph (\textbf{MCG}), a high-level representation of solution strategies, by combining LLM priors with causal discovery algorithms applied to a corpus of question-solution pairs. The resulting MCG encodes essential knowledge points and their causal dependencies. To better align the graph with downstream reasoning tasks, CAMA further refines the MCG through iterative feedback derived from a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Machine Learning in Materials Science · Advanced Graph Neural Networks
