CauSim: Scaling Causal Reasoning with Increasingly Complex Causal Simulators
Nicol\'as Astorga, Anita Kriz, and Mihaela van der Schaar

TL;DR
CauSim introduces a scalable framework for causal reasoning by constructing complex, verifiable causal simulators using LLMs, transforming scarce-label problems into supervised learning tasks across multiple representations.
Contribution
The paper presents CauSim, a novel method for building scalable, verifiable causal simulators with LLMs, enabling improved causal reasoning and data augmentation across representations.
Findings
CauSim enables generalization across different causal representations.
Scaling with curriculum and data volume improves LLM causal reasoning.
Self-generated simulators facilitate LLM self-improvement.
Abstract
Despite surpassing human performance across mathematics, coding, and other knowledge-intensive tasks, large language models (LLMs) continue to struggle with causal reasoning. A core obstacle is the target data itself: causal systems are complex and often expressed in non-executable forms, while ground-truth answers to causal queries are inherently scarce. We introduce CauSim, a framework that turns causal reasoning from a scarce-label problem into a scalable supervised one. CauSim constructs increasingly complex causal simulators: executable structural causal models (SCMs), incrementally built by LLMs, that scale to globally complex systems while maintaining verifiable answers to causal queries. CauSim operates across representations by formalizing non-executable causal knowledge into code, enabling data augmentation, and translating executable SCMs into natural language, enabling…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
