To adjust or not to adjust? Estimating the average treatment effect in randomized experiments with missing covariates
Anqi Zhao, Peng Ding

TL;DR
This paper investigates whether and how to adjust for missing covariates in randomized experiments to improve the estimation of the average treatment effect, emphasizing the use of missingness indicators.
Contribution
It introduces and advocates for the missingness-indicator method, providing theoretical analysis and modifications within a design-based framework for better estimation efficiency.
Findings
Missingness indicators act as fully observed covariates if missingness is unaffected by treatment.
Adding missingness indicators to regression adjustment improves estimation efficiency.
The proposed modifications enhance finite-sample and asymptotic performance.
Abstract
Complete randomization allows for consistent estimation of the average treatment effect based on the difference in means of the outcomes without strong modeling assumptions on the outcome-generating process. Appropriate use of the pretreatment covariates can further improve the estimation efficiency. However, missingness in covariates is common in experiments and raises an important question: should we adjust for covariates subject to missingness, and if so, how? The unadjusted difference in means is always unbiased. The complete-covariate analysis adjusts for all completely observed covariates and improves the efficiency of the difference in means if at least one completely observed covariate is predictive of the outcome. Then what is the additional gain of adjusting for covariates subject to missingness? A key insight is that the missingness indicators act as fully observed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Bayesian Inference · Advanced Causal Inference Techniques · Statistical Methods in Clinical Trials
