Interventional Causal Representation Learning
Kartik Ahuja, Divyat Mahajan, Yixin Wang, Yoshua Bengio

TL;DR
This paper demonstrates that interventional data can be used to identify causal latent factors in representation learning, providing theoretical guarantees without distributional assumptions.
Contribution
It proves that perfect and imperfect interventional data enable provable identification of latent causal factors up to permutation and scaling.
Findings
Identification of latent factors up to permutation and scaling with perfect interventions
Block affine identification with imperfect interventions
No assumptions on distributions or dependency structures needed
Abstract
Causal representation learning seeks to extract high-level latent factors from low-level sensory data. Most existing methods rely on observational data and structural assumptions (e.g., conditional independence) to identify the latent factors. However, interventional data is prevalent across applications. Can interventional data facilitate causal representation learning? We explore this question in this paper. The key observation is that interventional data often carries geometric signatures of the latent factors' support (i.e. what values each latent can possibly take). For example, when the latent factors are causally connected, interventions can break the dependency between the intervened latents' support and their ancestors'. Leveraging this fact, we prove that the latent causal factors can be identified up to permutation and scaling given data from perfect interventions.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsBayesian Modeling and Causal Inference · Explainable Artificial Intelligence (XAI) · Domain Adaptation and Few-Shot Learning
