TL;DR
This paper evaluates the performance of machine learning methods for causal effect estimation, highlighting their limitations and identifying approaches that improve bias and coverage in complex confounding scenarios.
Contribution
It demonstrates that ML-based double-robust estimators can outperform single-robust ones, especially when combined with sample splitting and rich modeling.
Findings
Double-robust estimators had less bias than single-robust ones.
ML algorithms with sample splitting and interactions achieved nominal coverage.
Single-robust ML estimators performed poorly in complex confounding scenarios.
Abstract
Unlike parametric regression, machine learning (ML) methods do not generally require precise knowledge of the true data generating mechanisms. As such, numerous authors have advocated for ML methods to estimate causal effects. Unfortunately, ML algorithms can perform worse than parametric regression. We demonstrate the performance of ML-based single- and double-robust estimators. We use 100 Monte Carlo samples with sample sizes of 200, 1200, and 5000 to investigate bias and confidence interval coverage under several scenarios. In a simple confounding scenario, confounders were related to the treatment and the outcome via parametric models. In a complex confounding scenario, the simple confounders were transformed to induce complicated nonlinear relationships. In the simple scenario, when ML algorithms were used, double-robust estimators were superior to single-robust estimators. In the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
