Asymptotic Theory of the Best-Choice Rerandomization using the Mahalanobis Distance

Yuhao Wang; Xinran Li

arXiv:2312.02513·stat.ME·September 17, 2025·1 cites

Asymptotic Theory of the Best-Choice Rerandomization using the Mahalanobis Distance

Yuhao Wang, Xinran Li

PDF

Open Access

TL;DR

This paper develops an asymptotic theory for the best-choice rerandomization method using Mahalanobis distance, showing it improves estimator concentration and can achieve near-optimal precision as sample size grows.

Contribution

It provides the first large-sample theoretical analysis of best-choice rerandomization with Mahalanobis distance, including asymptotic distribution and confidence interval construction.

Findings

01

Rerandomization concentrates the estimator around the true effect more than complete randomization.

02

Proposed confidence intervals are shorter and more accurate under rerandomization.

03

Best-choice rerandomization can achieve near-perfect covariate balance with increasing sample size.

Abstract

Rerandomization, a design that utilizes pretreatment covariates and improves their balance between different treatment groups, has received attention recently in both theory and practice. From a survey by Bruhn and McKenzie (2009), there are at least two types of rerandomization that are used in practice: the first rerandomizes the treatment assignment until covariate imbalance is below a prespecified threshold; the second randomizes the treatment assignment multiple times and chooses the one with the best covariate balance. In this paper we will consider the second type of rerandomization, namely the best-choice rerandomization, whose theory and inference are still lacking in the literature. In particular, we will focus on the best-choice rerandomization that uses the Mahalanobis distance to measure covariate imbalance, which is one of the most commonly used imbalance measure for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods in Clinical Trials · Advanced Causal Inference Techniques · Gene expression and cancer classification