Empirical Bayes Multistage Testing for Large-Scale Experiments
Hui Xu, Weinan Wang

TL;DR
This paper introduces AMSET, an adaptive multistage empirical Bayes testing method that leverages historical data for large-scale A/B testing, improving efficiency while controlling false discovery rates.
Contribution
It presents a novel empirical Bayes approach for multistage testing that incorporates historical data, offering robustness and efficiency in large-scale experiments.
Findings
AMSET achieves efficiency gains over traditional methods.
The method maintains marginal FDR control despite peeking.
Robust performance demonstrated on real and simulated data.
Abstract
Modern application of A/B tests is challenging due to its large scale in various dimensions, which demands flexibility to deal with multiple testing sequentially. The state-of-the-art practice first reduces the observed data stream to always-valid p-values, and then chooses a cut-off as in conventional multiple testing schemes. Here we propose an alternative method called AMSET (adaptive multistage empirical Bayes test) by incorporating historical data in decision-making to achieve efficiency gains while retaining marginal false discovery rate (mFDR) control that is immune to peeking. We also show that a fully data-driven estimation in AMSET performs robustly to various simulation and real data settings at a large mobile app social network company.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods in Clinical Trials · Statistical Methods and Bayesian Inference · Statistical Methods and Inference
