Amortized Active Causal Induction with Deep Reinforcement Learning

Yashas Annadani; Panagiotis Tigas; Stefan Bauer; Adam Foster

arXiv:2405.16718·cs.LG·May 28, 2024·1 cites

Amortized Active Causal Induction with Deep Reinforcement Learning

Yashas Annadani, Panagiotis Tigas, Stefan Bauer, Adam Foster

PDF

Open Access

TL;DR

This paper introduces CAASL, a reinforcement learning-based method using transformers for adaptive causal structure learning that generalizes well across environments and intervention types, improving causal graph estimation.

Contribution

The paper proposes a novel amortized intervention design policy using transformers trained with reinforcement learning for causal graph discovery, capable of zero-shot generalization.

Findings

01

Outperforms alternative strategies in causal graph estimation on synthetic data.

02

Achieves effective zero-shot generalization to higher-dimensional environments.

03

Successfully generalizes to unseen intervention types during testing.

Abstract

We present Causal Amortized Active Structure Learning (CAASL), an active intervention design policy that can select interventions that are adaptive, real-time and that does not require access to the likelihood. This policy, an amortized network based on the transformer, is trained with reinforcement learning on a simulator of the design environment, and a reward function that measures how close the true causal graph is to a causal graph posterior inferred from the gathered data. On synthetic data and a single-cell gene expression simulator, we demonstrate empirically that the data acquired through our policy results in a better estimate of the underlying causal graph than alternative strategies. Our design policy successfully achieves amortized intervention design on the distribution of the training environment while also generalizing well to distribution shifts in test-time design…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Machine Learning and Algorithms · Computability, Logic, AI Algorithms