Identifiability Guarantees for Causal Disentanglement from Purely   Observational Data

Ryan Welch; Jiaqi Zhang; Caroline Uhler

arXiv:2410.23620·cs.LG·December 25, 2024

Identifiability Guarantees for Causal Disentanglement from Purely Observational Data

Ryan Welch, Jiaqi Zhang, Caroline Uhler

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper establishes theoretical guarantees for identifying causal factors from purely observational data in nonlinear models with additive Gaussian noise, and proposes a practical algorithm for causal disentanglement.

Contribution

It provides the first precise characterization of what causal factors can be identified without interventions in nonlinear models, and introduces a quadratic programming algorithm for this purpose.

Findings

01

Causal variables can be identified up to a layer-wise transformation.

02

Further disentanglement beyond this layer-wise transformation is impossible.

03

The proposed algorithm successfully derives meaningful causal representations from observational data.

Abstract

Causal disentanglement aims to learn about latent causal factors behind data, holding the promise to augment existing representation learning methods in terms of interpretability and extrapolation. Recent advances establish identifiability results assuming that interventions on (single) latent factors are available; however, it remains debatable whether such assumptions are reasonable due to the inherent nature of intervening on latent variables. Accordingly, we reconsider the fundamentals and ask what can be learned using just observational data. We provide a precise characterization of latent factors that can be identified in nonlinear causal models with additive Gaussian noise and linear mixing, without any interventions or graphical restrictions. In particular, we show that the causal variables can be identified up to a layer-wise transformation and that further disentanglement is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

uhlerlab/observational-crl
pytorchOfficial

Videos

Identifiability Guarantees for Causal Disentanglement from Purely Observational Data· slideslive

Taxonomy

TopicsBayesian Modeling and Causal Inference