# Leveraging Random Assignment to Impute Missing Covariates in Causal   Studies

**Authors:** Gauri Kamat, Jerome P. Reiter

arXiv: 1908.01333 · 2023-08-29

## TL;DR

This paper examines whether leveraging randomized treatment assignment improves the accuracy of imputing missing covariates in experimental studies, finding only small gains in typical scenarios.

## Contribution

It investigates the benefits of using randomization information in imputation methods for missing covariates in randomized experiments, through comprehensive simulation studies.

## Key findings

- Respecting randomization yields small accuracy improvements.
- Imputation methods using covariates or outcomes show similar performance.
- Guidance on imputing missing covariates for heterogeneous treatment effect estimation.

## Abstract

Baseline covariates in randomized experiments are often used in the estimation of treatment effects, for example, when estimating treatment effects within covariate-defined subgroups. In practice, however, covariate values may be missing for some data subjects. To handle missing values, analysts can use imputation methods to create completed datasets, from which they can estimate treatment effects. Common imputation methods include mean imputation, single imputation via regression, and multiple imputation. For each of these methods, we investigate the benefits of leveraging randomized treatment assignment in the imputation routines, that is, making use of the fact that the true covariate distributions are the same across treatment arms. We do so using simulation studies that compare the quality of inferences when we respect or disregard the randomization. We consider this question for imputation routines implemented using covariates only, and imputation routines implemented using the outcome variable. In either case, accounting for randomization offers only small gains in accuracy for our simulation scenarios. Our results also shed light on the performances of these different procedures for imputing missing covariates in randomized experiments when one seeks to estimate heterogeneous treatment effects.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1908.01333/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/1908.01333/full.md

## References

45 references — full list in the complete paper: https://tomesphere.com/paper/1908.01333/full.md

---
Source: https://tomesphere.com/paper/1908.01333