Distilling interpretable causal trees from causal forests

Patrick Rehill

arXiv:2408.01023·econ.EM·August 5, 2024

Distilling interpretable causal trees from causal forests

Patrick Rehill

PDF

Open Access

TL;DR

This paper introduces the Distilled Causal Tree, an interpretable model that extracts a single, understandable causal tree from a complex causal forest, outperforming existing methods especially in noisy or high-dimensional data.

Contribution

It presents a novel method for distilling a single interpretable causal tree from a causal forest, improving interpretability without sacrificing accuracy in complex data settings.

Findings

01

Outperforms existing tree extraction methods in noisy and high-dimensional data

02

Estimates are doubly robust and asymptotically normal

03

Provides clearer insights from complex causal forests

Abstract

Machine learning methods for estimating treatment effect heterogeneity promise greater flexibility than existing methods that test a few pre-specified hypotheses. However, one problem these methods can have is that it can be challenging to extract insights from complicated machine learning models. A high-dimensional distribution of conditional average treatment effects may give accurate, individual-level estimates, but it can be hard to understand the underlying patterns; hard to know what the implications of the analysis are. This paper proposes the Distilled Causal Tree, a method for distilling a single, interpretable causal tree from a causal forest. This compares well to existing methods of extracting a single tree, particularly in noisy data or high-dimensional data where there are many correlated features. Here it even outperforms the base causal forest in most simulations. Its…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference

MethodsBalanced Selection