Counterfactual Prediction with Deep Instrumental Variables Networks
Jason Hartford, Greg Lewis, Kevin Leyton-Brown, Matt Taddy

TL;DR
This paper introduces Deep IV, a deep learning framework for causal inference with instrumental variables, combining neural networks for treatment prediction and outcome modeling, enabling flexible and scalable causal effect estimation.
Contribution
It proposes a novel deep learning approach for instrumental variable analysis, integrating treatment and outcome prediction into a unified neural network framework.
Findings
Deep IV effectively estimates causal effects using neural networks.
The framework supports Bayesian and frequentist inference methods.
It avoids extensive algorithm customization by leveraging off-the-shelf ML tools.
Abstract
We are in the middle of a remarkable rise in the use and capability of artificial intelligence. Much of this growth has been fueled by the success of deep learning architectures: models that map from observables to outputs via multiple layers of latent representations. These deep learning algorithms are effective tools for unstructured prediction, and they can be combined in AI systems to solve complex automated reasoning problems. This paper provides a recipe for combining ML algorithms to solve for causal effects in the presence of instrumental variables -- sources of treatment randomization that are conditionally independent from the response. We show that a flexible IV specification resolves into two prediction tasks that can be solved with deep neural nets: a first-stage network for treatment prediction and a second-stage network whose loss function involves integration over the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Machine Learning and Algorithms · Domain Adaptation and Few-Shot Learning
