Challenges in Obtaining Valid Causal Effect Estimates with Machine   Learning Algorithms

Ashley I Naimi; Alan E Mishler; Edward H Kennedy

arXiv:1711.07137·stat.ME·May 15, 2020

Challenges in Obtaining Valid Causal Effect Estimates with Machine Learning Algorithms

Ashley I Naimi, Alan E Mishler, Edward H Kennedy

PDF

2 Repos

TL;DR

This paper evaluates the performance of machine learning methods for causal effect estimation, highlighting their limitations and identifying approaches that improve bias and coverage in complex confounding scenarios.

Contribution

It demonstrates that ML-based double-robust estimators can outperform single-robust ones, especially when combined with sample splitting and rich modeling.

Findings

01

Double-robust estimators had less bias than single-robust ones.

02

ML algorithms with sample splitting and interactions achieved nominal coverage.

03

Single-robust ML estimators performed poorly in complex confounding scenarios.

Abstract

Unlike parametric regression, machine learning (ML) methods do not generally require precise knowledge of the true data generating mechanisms. As such, numerous authors have advocated for ML methods to estimate causal effects. Unfortunately, ML algorithms can perform worse than parametric regression. We demonstrate the performance of ML-based single- and double-robust estimators. We use 100 Monte Carlo samples with sample sizes of 200, 1200, and 5000 to investigate bias and confidence interval coverage under several scenarios. In a simple confounding scenario, confounders were related to the treatment and the outcome via parametric models. In a complex confounding scenario, the simple confounders were transformed to induce complicated nonlinear relationships. In the simple scenario, when ML algorithms were used, double-robust estimators were superior to single-robust estimators. In the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.