Inducing Causal World Models in LLMs for Zero-Shot Physical Reasoning
Aditya Sharma, Ananya Gupta, Chengyu Wang, Chiamaka Adebayo, and Jakub Kowalski

TL;DR
This paper presents CWMI, a framework that embeds causal physics into LLMs, enabling better zero-shot physical reasoning by learning cause-and-effect relationships through a dedicated module and intervention-based training.
Contribution
Introduces Causal World Model Induction (CWMI), integrating a causal physics module and intervention loss to enhance LLMs' understanding of physical dynamics.
Findings
CWMI outperforms existing LLMs on zero-shot physical reasoning benchmarks.
The model effectively learns cause-and-effect relationships from multimodal data.
Experimental results validate the importance of causal modeling for physical reasoning.
Abstract
Large Language Models (LLMs), despite their advanced linguistic capabilities, fundamentally lack an intuitive understanding of physical dynamics, which limits their effectiveness in real-world scenarios that require causal reasoning. In this paper, we introduce Causal World Model Induction (CWMI), a novel framework designed to embed an explicit model of causal physics within an LLM. Our approach incorporates a dedicated Causal Physics Module (CPM) and a new training objective called Causal Intervention Loss, encouraging the model to learn cause-and-effect relationships from multimodal data. By training the model to predict the outcomes of hypothetical interventions instead of merely capturing statistical correlations, CWMI develops a robust internal representation of physical laws. Experimental results show that CWMI significantly outperforms state-of-the-art LLMs on zero-shot physical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Bayesian Modeling and Causal Inference · Multimodal Machine Learning Applications
