CauSim: Scaling Causal Reasoning with Increasingly Complex Causal Simulators

Nicol\'as Astorga; Anita Kriz; and Mihaela van der Schaar

arXiv:2605.09079·cs.AI·May 12, 2026

CauSim: Scaling Causal Reasoning with Increasingly Complex Causal Simulators

Nicol\'as Astorga, Anita Kriz, and Mihaela van der Schaar

PDF

TL;DR

CauSim introduces a scalable framework for causal reasoning by constructing complex, verifiable causal simulators using LLMs, transforming scarce-label problems into supervised learning tasks across multiple representations.

Contribution

The paper presents CauSim, a novel method for building scalable, verifiable causal simulators with LLMs, enabling improved causal reasoning and data augmentation across representations.

Findings

01

CauSim enables generalization across different causal representations.

02

Scaling with curriculum and data volume improves LLM causal reasoning.

03

Self-generated simulators facilitate LLM self-improvement.

Abstract

Despite surpassing human performance across mathematics, coding, and other knowledge-intensive tasks, large language models (LLMs) continue to struggle with causal reasoning. A core obstacle is the target data itself: causal systems are complex and often expressed in non-executable forms, while ground-truth answers to causal queries are inherently scarce. We introduce CauSim, a framework that turns causal reasoning from a scarce-label problem into a scalable supervised one. CauSim constructs increasingly complex causal simulators: executable structural causal models (SCMs), incrementally built by LLMs, that scale to globally complex systems while maintaining verifiable answers to causal queries. CauSim operates across representations by formalizing non-executable causal knowledge into code, enabling data augmentation, and translating executable SCMs into natural language, enabling…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.