Leveraging machine learning to estimate individualized treatment effects in cluster-randomized trials
Changjun Li, Xi Fang, Michael O. Harhay, Andrew B. Forbes, F. Perry Wilson, Guangyu Tong, Fan Li

TL;DR
This paper develops a unified mixed-effects machine learning framework to estimate individualized treatment effects in cluster-randomized trials, accounting for both individual and cluster-level covariates.
Contribution
It introduces a comprehensive approach combining various machine learning methods with mixed-effects models for causal inference in CRTs.
Findings
Methods evaluated across diverse simulations.
Application demonstrated in Ghana hypertension trial.
Provided practical guidance and reproducible code.
Abstract
Cluster-randomized trials (CRTs) are widely used to evaluate interventions delivered at the clinic, practice, or community level. Although standard analyses typically target average treatment effects, such summaries mask potentially meaningful variation in treatment response across individuals and clusters. This work addresses the estimation of conditional average treatment effects (CATEs) for continuous outcomes in two-arm parallel CRTs by defining causal estimands that incorporate both individual- and cluster-level baseline covariates while marginalizing over unobserved cluster heterogeneity. To estimate these quantities, we develop a unified framework based on mixed-effects machine learning, integrating and extending a range of existing approaches, including Bayesian additive regression trees with random effects, multilevel Bayesian causal forests, mixed-effects random forests,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
