Design-based theory for cluster rerandomization
Xin Lu, Tianle Liu, Hanzhong Liu, Peng Ding

TL;DR
This paper develops a new design-based theoretical framework for cluster rerandomization, addressing the gap in existing asymptotic theory and comparing two schemes based on covariate importance, with practical recommendations for analysis.
Contribution
It introduces the first design-based theory for cluster rerandomization and compares two covariate balancing schemes, providing insights into their optimal use and analysis procedures.
Findings
Weighted Euclidean distance scheme outperforms Mahalanobis with tiers
Optimal weights and orthogonalized covariates improve balance
Recommended covariate-adjusted analysis procedures
Abstract
Complete randomization balances covariates on average, but covariate imbalance often exists in finite samples. Rerandomization can ensure covariate balance in the realized experiment by discarding the undesired treatment assignments. Many field experiments in public health and social sciences assign the treatment at the cluster level due to logistical constraints or policy considerations. Moreover, they are frequently combined with rerandomization in the design stage. We refer to cluster rerandomization as a cluster-randomized experiment compounded with rerandomization to balance covariates at the individual or cluster level. Existing asymptotic theory can only deal with rerandomization with treatments assigned at the individual level, leaving that for cluster rerandomization an open problem. To fill the gap, we provide a design-based theory for cluster rerandomization. Moreover, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods in Clinical Trials · Advanced Causal Inference Techniques · Statistical Methods and Bayesian Inference
