Benchmarking covariate-adjustment strategies for randomized clinical trials
Yulin Shao, Liangbo Lyu, Menggang Yu, Bingkai Wang

TL;DR
This study empirically benchmarks 18 covariate adjustment strategies across 50 RCTs, showing that simple, transparent methods often outperform complex machine-learning approaches in improving statistical efficiency.
Contribution
It provides the first large-scale empirical comparison of covariate adjustment methods, guiding best practices for routine RCT analysis.
Findings
Covariate adjustment reduces variance by median 13.3% for continuous outcomes.
Machine-learning methods with default settings do not outperform simple models.
Parsimonious regression approaches are stable and effective across sample sizes.
Abstract
Covariate adjustment is widely recommended to improve statistical efficiency in randomized clinical trials (RCTs), yet empirical evidence comparing available strategies remains limited. This lack of real-world evaluation leaves unresolved practical questions about which adjustment methods to use and which covariates to include. To address this gap, we conduct a large-scale empirical benchmarking using individual-level data from 50 publicly accessible RCTs comprising 29,094 participants and 574 treatment-outcome pairs. We evaluate 18 analytical strategies formed by combining six estimators-including classical regression, inverse probability weighting, and machine-learning methods-with three covariate-selection rules. Across diverse therapeutic areas, covariate adjustment consistently improves precision, yielding median variance reductions of 13.3% relative to unadjusted analyses for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Causal Inference Techniques · Statistical Methods in Clinical Trials · Statistical Methods and Inference
