Many Experiments, Few Repetitions, Unpaired Data, and Sparse Effects: Is Causal Inference Possible?
Felix Schur, Niklas Pfister, Peng Ding, Sach Mukherjee, Jonas Peters

TL;DR
This paper investigates the challenge of estimating causal effects with hidden confounding using unpaired data across multiple environments, proposing a GMM estimator that remains consistent with many environments but few observations per environment.
Contribution
It introduces a novel GMM-based estimator for causal inference in unpaired data settings with many environments and few observations, extending to sparse effects with regularization.
Findings
The proposed estimator is consistent as the number of environments increases.
The method effectively estimates sparse causal effects using regularization.
Standard two-sample IV estimators fail under many environments with limited data.
Abstract
We study the problem of estimating causal effects under hidden confounding in the following unpaired data setting: we observe some covariates and an outcome under different experimental conditions (environments) but do not observe them jointly; we either observe or . Under appropriate regularity conditions, the problem can be cast as an instrumental variable (IV) regression with the environment acting as a (possibly high-dimensional) instrument. When there are many environments but only a few observations per environment, standard two-sample IV estimators fail to be consistent. We propose a GMM-type estimator based on cross-fold sample splitting of the instrument-covariate sample and prove that it is consistent as the number of environments grows but the sample size per environment remains constant. We further extend the method to sparse causal effects via…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Causal Inference Techniques · Statistical Methods and Inference · Statistical Methods and Bayesian Inference
