Recursive Partitioning for Heterogeneous Causal Effects
Susan Athey, Guido Imbens

TL;DR
This paper introduces a data-driven recursive partitioning method to identify and test heterogeneity in causal treatment effects across subpopulations, addressing challenges in causal inference and model validation.
Contribution
It develops novel cross-validation criteria tailored for predicting causal effects, enabling detection of treatment heterogeneity without invalidating inference.
Findings
Method effectively identifies subpopulations with different treatment effects.
Proposed cross-validation criteria outperform standard methods in simulations.
Application to search engine experiment demonstrates practical utility.
Abstract
In this paper we study the problems of estimating heterogeneity in causal effects in experimental or observational studies and conducting inference about the magnitude of the differences in treatment effects across subsets of the population. In applications, our method provides a data-driven approach to determine which subpopulations have large or small treatment effects and to test hypotheses about the differences in these effects. For experiments, our method allows researchers to identify heterogeneity in treatment effects that was not specified in a pre-analysis plan, without concern about invalidating inference due to multiple testing. In most of the literature on supervised machine learning (e.g. regression trees, random forests, LASSO, etc.), the goal is to build a model of the relationship between a unit's attributes and an observed outcome. A prominent role in these methods is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Causal Inference Techniques · Statistical Methods and Inference · Statistical Methods and Bayesian Inference
