Modular Learning of Deep Causal Generative Models for High-dimensional Causal Inference
Md Musfiqur Rahman, Murat Kocaoglu

TL;DR
This paper introduces Modular-DCM, a novel deep causal generative modeling approach that efficiently leverages pre-trained models for high-dimensional causal inference, addressing the challenge of high-dimensional data such as images.
Contribution
It presents the first modular training algorithm that uses adversarial training with pre-trained models for causal inference in high-dimensional settings, including latent confounders.
Findings
Outperforms baselines on Colored-MNIST dataset
Demonstrates convergence on COVIDx dataset
Shows utility in causal invariant prediction on CelebA-HQ
Abstract
Sound and complete algorithms have been proposed to compute identifiable causal queries using the causal structure and data. However, most of these algorithms assume accurate estimation of the data distribution, which is impractical for high-dimensional variables such as images. On the other hand, modern deep generative architectures can be trained to sample from high-dimensional distributions. However, training these networks are typically very costly. Thus, it is desirable to leverage pre-trained models to answer causal queries using such high-dimensional data. To address this, we propose modular training of deep causal generative models that not only makes learning more efficient, but also allows us to utilize large, pre-trained conditional generative models. To the best of our knowledge, our algorithm, Modular-DCM is the first algorithm that, given the causal structure, uses…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Bayesian Modeling and Causal Inference · Domain Adaptation and Few-Shot Learning
