Network-aware IV Regression for Causal Node Discovery and Estimation
Samhita Pal, Dhrubajyoti Ghosh

TL;DR
This paper introduces a novel two-stage regression method that combines instrumental variables and graph-based regularization to improve causal effect estimation in network-structured high-dimensional data, handling invalid instruments and providing theoretical guarantees.
Contribution
It develops a new framework integrating IV regression with graph regularization for causal discovery, addressing latent confounding and invalid instruments in structured data.
Findings
Improved causal variable selection accuracy over existing methods.
Theoretical non-asymptotic guarantees for estimation and selection.
Successful application to brain imaging and genetic data identifying causal regions.
Abstract
Estimating causal effects from high-dimensional, structured exposures is a fundamental challenge in modern applications ranging from neuroscience and finance to environmental science. While the literature has addressed high-dimensional instrumental variable (IV) regression, and separately leveraged graph structure in penalized regression, the integration of both, especially for causal support recovery in the presence of latent confounding, remains unexplored. In this work, we propose a novel two-stage regression framework that incorporates instrumental variables and graph-based regularization to uncover sparse causal effects among network-structured exposures. Our method accommodates both valid and partially invalid instruments, and encourages structural similarity among connected predictors through a graph-fused penalty. We establish non-asymptotic guarantees for estimation accuracy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
