Sparse Causal Effect Estimation using Two-Sample Summary Statistics in the Presence of Unmeasured Confounding
Shimeng Huang, Niklas Pfister, Jack Bowden

TL;DR
This paper introduces spaceTSIV, a novel sparse causal effect estimator for two-sample summary statistics in genetic epidemiology, addressing unmeasured confounding and high-dimensional covariates.
Contribution
It develops a new sparse two-sample IV estimator, spaceTSIV, with proven identifiability and consistency, tailored for high-dimensional causal inference in summary statistic data.
Findings
spaceTSIV outperforms existing methods in simulations
Proven identifiability and consistency of the proposed estimator
Applied to real data for drug-target discovery
Abstract
Observational genome-wide association studies are now widely used for causal inference in genetic epidemiology. To maintain privacy, such data is often only publicly available as summary statistics, and often studies for the endogenous covariates and the outcome are available separately. This has necessitated methods tailored to two-sample summary statistics. Current state-of-the-art methods modify linear instrumental variable (IV) regression -- with genetic variants as instruments -- to account for unmeasured confounding. However, since the endogenous covariates can be high dimensional, standard IV assumptions are generally insufficient to identify all causal effects simultaneously. We ensure identifiability by assuming the causal effects are sparse and propose a sparse causal effect two-sample IV estimator, spaceTSIV, adapting the spaceIV estimator by Pfister and Peters (2022) for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Causal Inference Techniques
