Use What You Know: Causal Foundation Models with Partial Graphs
Arik Reuter, Anish Dhir, Cristiana Diaconu, Jake Robertson, Ole Ossen, Frank Hutter, Adrian Weller, Mark van der Wilk, Bernhard Sch\"olkopf

TL;DR
This paper introduces methods to incorporate causal domain knowledge into Causal Foundation Models (CFMs), enabling them to utilize partial or complete causal information for improved causal inference.
Contribution
It proposes conditioning strategies, especially learnable biases in attention, to integrate causal knowledge into CFMs, enhancing their flexibility and performance.
Findings
Learnable biases in attention effectively utilize causal information.
Conditioned CFMs match specialized models' performance.
Partial causal information can be effectively leveraged.
Abstract
Estimating causal quantities traditionally relies on bespoke estimators tailored to specific assumptions. Recently proposed Causal Foundation Models (CFMs) promise a more unified approach by amortising causal discovery and inference in a single step. However, in their current state, they do not allow for the incorporation of any domain knowledge, which can lead to suboptimal predictions. We bridge this gap by introducing methods to condition CFMs on causal information, such as the causal graph or more readily available ancestral information. When access to complete causal graph information is too strict a requirement, our approach also effectively leverages partial causal information. We systematically evaluate conditioning strategies and find that injecting learnable biases into the attention mechanism is the most effective method to utilise full and partial causal information. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Explainable Artificial Intelligence (XAI) · Advanced Graph Neural Networks
