Comment on "Generic machine learning inference on heterogeneous   treatment effects in randomized experiments."

Kosuke Imai; Michael Lingzhi Li

arXiv:2502.06758·stat.ME·February 11, 2025

Comment on "Generic machine learning inference on heterogeneous treatment effects in randomized experiments."

Kosuke Imai, Michael Lingzhi Li

PDF

Open Access

TL;DR

This paper proposes an alternative randomization inference method for heterogeneous treatment effect estimation that reduces computational costs while maintaining statistical validity, improving practicality for large-scale applications.

Contribution

It introduces a new RI approach that avoids repeated data splitting, leveraging cross-fitting and design-based inference for efficient, valid uncertainty quantification.

Findings

01

RI retains statistical efficiency similar to SSRI

02

RI significantly reduces computational costs

03

Simulation results confirm RI's practical advantages

Abstract

We analyze the split-sample robust inference (SSRI) methodology proposed by Chernozhukov, Demirer, Duflo, and Fernandez-Val (CDDF) for quantifying uncertainty in heterogeneous treatment effect estimation. While SSRI effectively accounts for randomness in data splitting, its computational cost can be prohibitive when combined with complex machine learning (ML) models. We present an alternative randomization inference (RI) approach that maintains SSRI's generality without requiring repeated data splitting. By leveraging cross-fitting and design-based inference, RI achieves valid confidence intervals while significantly reducing computational burden. We compare the two methods through simulation, demonstrating that RI retains statistical efficiency while being more practical for large-scale applications.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference