Rerandomization for covariate balance mitigates p-hacking in regression adjustment
Xin Lu, Peng Ding

TL;DR
This paper demonstrates that rerandomization in experimental design reduces false positives caused by p-hacking, especially when using strict thresholds, thereby improving the reliability of treatment effect estimates.
Contribution
It provides a theoretical framework showing rerandomization mitigates p-hacking effects and guides threshold selection for practical implementation.
Findings
Rerandomization reduces false discoveries from p-hacking.
Stringent rerandomization thresholds effectively resolve p-hacking.
Guidance on choosing rerandomization thresholds in practice.
Abstract
Rerandomization enforces covariate balance across treatment groups in the design stage of experiments. Despite its intuitive appeal, its theoretical justification remains unsatisfying because its benefits of improving efficiency for estimating the average treatment effect diminish if we use regression adjustment in the analysis stage. To strengthen the theory of rerandomization, we show that it mitigates false discoveries resulting from -hacking, the practice of strategically selecting covariates to get more significant -values. Moreover, we show that rerandomization with a sufficiently stringent threshold can resolve -hacking. As a byproduct, our theory offers guidance for choosing the threshold in rerandomization in practice.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference
