A Statistically Reliable Optimization Framework for Bandit Experiments in Scientific Discovery

Tong Li; Travis Mandel; Goldie Phillips; Anna Rafferty; Eric M. Schwartz; Dehan Kong; and Joseph J. Williams

arXiv:2603.11267·stat.AP·March 18, 2026

A Statistically Reliable Optimization Framework for Bandit Experiments in Scientific Discovery

Tong Li, Travis Mandel, Goldie Phillips, Anna Rafferty, Eric M. Schwartz, Dehan Kong, and Joseph J. Williams

PDF

Open Access

TL;DR

This paper introduces a statistically reliable optimization framework for adaptive bandit experiments in scientific research, improving power and validity while balancing reward and statistical efficiency.

Contribution

It develops correction methods for hypothesis testing under adaptive sampling and proposes a unified framework to optimize experiment design balancing reward and statistical power.

Findings

01

Higher statistical power than existing methods in simulations

02

Enables valid hypothesis testing with adaptive sampling

03

Practitioners can improve outcomes with minimal additional steps

Abstract

Scientific experimentation is largely driven by statistical hypothesis testing to determine significant differences in interventions. Traditionally, experimenters allocate samples uniformly between each intervention. However, such an approach may lead to suboptimal outcomes - multi-armed bandits (MABs) addresses this problem by allocating samples adaptively to maximize outcomes. Yet, two challenges have hindered the use of MABs in scientific domains. First, common hypothesis tests (e.g., $t$ -tests) become invalid under adaptive sampling without correction, leading to inflated type~I and type~II errors. This is an understudied problem, and prior solutions suffer from issues such as low statistical power which prevent adoption in many practical settings. Second, practitioners must explicitly balance cumulative reward with statistical efficiency, yet no general methodology exists to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Advanced Causal Inference Techniques · Machine Learning and Algorithms