Amortized Active Causal Induction with Deep Reinforcement Learning
Yashas Annadani, Panagiotis Tigas, Stefan Bauer, Adam Foster

TL;DR
This paper introduces CAASL, a reinforcement learning-based method using transformers for adaptive causal structure learning that generalizes well across environments and intervention types, improving causal graph estimation.
Contribution
The paper proposes a novel amortized intervention design policy using transformers trained with reinforcement learning for causal graph discovery, capable of zero-shot generalization.
Findings
Outperforms alternative strategies in causal graph estimation on synthetic data.
Achieves effective zero-shot generalization to higher-dimensional environments.
Successfully generalizes to unseen intervention types during testing.
Abstract
We present Causal Amortized Active Structure Learning (CAASL), an active intervention design policy that can select interventions that are adaptive, real-time and that does not require access to the likelihood. This policy, an amortized network based on the transformer, is trained with reinforcement learning on a simulator of the design environment, and a reward function that measures how close the true causal graph is to a causal graph posterior inferred from the gathered data. On synthetic data and a single-cell gene expression simulator, we demonstrate empirically that the data acquired through our policy results in a better estimate of the underlying causal graph than alternative strategies. Our design policy successfully achieves amortized intervention design on the distribution of the training environment while also generalizing well to distribution shifts in test-time design…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Machine Learning and Algorithms · Computability, Logic, AI Algorithms
