A Statistically Reliable Optimization Framework for Bandit Experiments in Scientific Discovery
Tong Li, Travis Mandel, Goldie Phillips, Anna Rafferty, Eric M. Schwartz, Dehan Kong, and Joseph J. Williams

TL;DR
This paper introduces a statistically reliable optimization framework for adaptive bandit experiments in scientific research, improving power and validity while balancing reward and statistical efficiency.
Contribution
It develops correction methods for hypothesis testing under adaptive sampling and proposes a unified framework to optimize experiment design balancing reward and statistical power.
Findings
Higher statistical power than existing methods in simulations
Enables valid hypothesis testing with adaptive sampling
Practitioners can improve outcomes with minimal additional steps
Abstract
Scientific experimentation is largely driven by statistical hypothesis testing to determine significant differences in interventions. Traditionally, experimenters allocate samples uniformly between each intervention. However, such an approach may lead to suboptimal outcomes - multi-armed bandits (MABs) addresses this problem by allocating samples adaptively to maximize outcomes. Yet, two challenges have hindered the use of MABs in scientific domains. First, common hypothesis tests (e.g., -tests) become invalid under adaptive sampling without correction, leading to inflated type~I and type~II errors. This is an understudied problem, and prior solutions suffer from issues such as low statistical power which prevent adoption in many practical settings. Second, practitioners must explicitly balance cumulative reward with statistical efficiency, yet no general methodology exists to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Advanced Causal Inference Techniques · Machine Learning and Algorithms
