Estimating Causal Effects with Double Machine Learning -- A Method Evaluation
Jonathan Fuhr, Philipp Berens, Dominik Papies

TL;DR
This paper reviews and empirically evaluates double machine learning (DML) for causal effect estimation, demonstrating its advantages in handling nonlinear confounding and providing practical guidance for its application.
Contribution
It offers a comprehensive review and empirical comparison of DML against traditional methods, highlighting its strengths and limitations in causal inference.
Findings
DML improves adjustment for nonlinear confounding.
DML estimates of air pollution effects are larger than traditional methods.
Application of flexible ML algorithms enhances causal effect estimation.
Abstract
The estimation of causal effects with observational data continues to be a very active research area. In recent years, researchers have developed new frameworks which use machine learning to relax classical assumptions necessary for the estimation of causal effects. In this paper, we review one of the most prominent methods - "double/debiased machine learning" (DML) - and empirically evaluate it by comparing its performance on simulated data relative to more traditional statistical methods, before applying it to real-world data. Our findings indicate that the application of a suitably flexible machine learning algorithm within DML improves the adjustment for various nonlinear confounding relationships. This advantage enables a departure from traditional functional form assumptions typically necessary in causal effect estimation. However, we demonstrate that the method continues to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Explainable Artificial Intelligence (XAI)
