Causal Representation Learning from Multiple Distributions: A General Setting
Kun Zhang, Shaoan Xie, Ignavier Ng, Yujia Zheng

TL;DR
This paper addresses the challenge of recovering latent causal variables and their relationships from multiple distributions without relying on parametric assumptions or interventions, providing a general nonparametric framework with theoretical guarantees.
Contribution
It introduces a novel nonparametric approach for causal representation learning from heterogeneous data, recovering causal structures under sparsity and change conditions without hard interventions.
Findings
Recovery of the moralized graph of the causal structure.
Latent variables can be recovered up to component-wise transformations.
Theoretical validation through experimental results.
Abstract
In many problems, the measured variables (e.g., image pixels) are just mathematical functions of the latent causal variables (e.g., the underlying concepts or objects). For the purpose of making predictions in changing environments or making proper changes to the system, it is helpful to recover the latent causal variables and their causal relations represented by graph . This problem has recently been known as causal representation learning. This paper is concerned with a general, completely nonparametric setting of causal representation learning from multiple distributions (arising from heterogeneous data or nonstationary time series), without assuming hard interventions behind distribution changes. We aim to develop general solutions in this fundamental case; as a by product, this helps see the unique benefit offered by other assumptions such as parametric causal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Imbalanced Data Classification Techniques
