Covariate Balance in Simple, Stratified and Clustered Comparative Studies
Ben B. Hansen, Jake Bowers

TL;DR
This paper explores methods for assessing covariate balance in randomized studies, comparing descriptive and significance tests, and applying Fisher's randomization inference to improve balance evaluation, especially in clustered designs.
Contribution
It introduces a framework using Fisher's randomization inference to evaluate covariate balance, challenging previous conclusions and addressing complexities in stratified and clustered experiments.
Findings
Revealed potential misinterpretations in previous balance assessments
Provided new methods for reliable balance testing in moderate samples
Highlighted the impact of clustering on balance evaluation
Abstract
In randomized experiments, treatment and control groups should be roughly the same--balanced--in their distributions of pretreatment variables. But how nearly so? Can descriptive comparisons meaningfully be paired with significance tests? If so, should there be several such tests, one for each pretreatment variable, or should there be a single, omnibus test? Could such a test be engineered to give easily computed -values that are reliable in samples of moderate size, or would simulation be needed for reliable calibration? What new concerns are introduced by random assignment of clusters? Which tests of balance would be optimal? To address these questions, Fisher's randomization inference is applied to the question of balance. Its application suggests the reversal of published conclusions about two studies, one clinical and the other a field experiment in political participation.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
