How to manage missing covariates in randomized controlled trials: a comparison of strategies
Shiyu Zhang, Yajuan Si, John J. Dziak

TL;DR
This paper compares strategies for handling missing data in RCTs, showing which methods produce unbiased treatment effect estimates under different conditions.
Contribution
The paper clarifies how different missing data strategies perform in RCTs and reconciles conflicting recommendations in the literature.
Findings
MI by arm provides unbiased estimates for both average and subgroup treatment effects under MAR.
Baseline-only MI, grand mean imputation, and missing indicator method yield unbiased average treatment effects but biased subgroup effects.
Simple strategies can perform well for primary analyses but fail for secondary analyses under certain conditions.
Abstract
When analyzing randomized controlled trials (RCTs) data, covariate adjustment is often employed to increase the precision of estimated treatment effects. Missing data in covariates, if not handled properly, can result in biased and inefficient estimates. However, the existing literature on handling missing covariate data is limited, and recommendations vary regarding a valid and efficient approach. To help reconcile the seemingly inconsistent recommendations, we address two questions through methodological descriptions and simulated demonstrations. First, how should a multiple imputation (MI) model be specified for RCTs to best preserve the benefit of the randomization design? We consider three different approaches: MI with only baseline variables, “MI overall”, and “MI by arm”. Second, when and why will simple general strategies, such as grand mean imputation and the missing indicator…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Bayesian Inference · Advanced Causal Inference Techniques · Statistical Methods in Clinical Trials
