Targeted Deep Architectures: A TMLE-Based Framework for Robust Causal Inference in Neural Networks
Yi Li, David Mccoy, Nolan Gunter, Kaitlyn Lee, Alejandro Schuler, Mark van der Laan

TL;DR
This paper introduces Targeted Deep Architectures (TDA), a novel framework embedding TMLE into neural networks to enable valid causal inference with asymptotic guarantees and improved empirical performance.
Contribution
TDA integrates TMLE directly into neural network training, allowing for unbiased, efficient causal estimates and extending to complex multi-parameter causal targets.
Findings
Reduces bias in treatment effect estimation.
Improves confidence interval coverage.
Extends to multi-dimensional causal estimands.
Abstract
Modern deep neural networks are powerful predictive tools yet often lack valid inference for causal parameters, such as treatment effects or entire survival curves. While frameworks like Double Machine Learning (DML) and Targeted Maximum Likelihood Estimation (TMLE) can debias machine-learning fits, existing neural implementations either rely on "targeted losses" that do not guarantee solving the efficient influence function equation or computationally expensive post-hoc "fluctuations" for multi-parameter settings. We propose Targeted Deep Architectures (TDA), a new framework that embeds TMLE directly into the network's parameter space with no restrictions on the backbone architecture. Specifically, TDA partitions model parameters - freezing all but a small "targeting" subset - and iteratively updates them along a targeting gradient, derived from projecting the influence functions onto…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Fault Detection and Control Systems · Adversarial Robustness in Machine Learning
