Multi-armed Bandit Models for the Optimal Design of Clinical Trials: Benefits and Challenges
Sof\'ia S. Villar, Jack Bowden, James Wason

TL;DR
This paper reviews multi-armed bandit models for clinical trial design, highlighting their benefits in patient allocation and proposing a new rule to improve statistical power, thus bridging theory and practice.
Contribution
It introduces a novel bandit-based patient allocation rule that addresses low statistical power, facilitating practical application in clinical trials.
Findings
Bandit models assign more patients to better treatments.
They have limitations in statistical power.
A new allocation rule improves power in clinical trials.
Abstract
Multi-armed bandit problems (MABPs) are a special type of optimal control problem well suited to model resource allocation under uncertainty in a wide variety of contexts. Since the first publication of the optimal solution of the classic MABP by a dynamic index rule, the bandit literature quickly diversified and emerged as an active research topic. Across this literature, the use of bandit models to optimally design clinical trials became a typical motivating application, yet little of the resulting theory has ever been used in the actual design and analysis of clinical trials. To this end, we review two MABP decision-theoretic approaches to the optimal allocation of treatments in a clinical trial: the infinite-horizon Bayesian Bernoulli MABP and the finite-horizon variant. These models possess distinct theoretical properties and lead to separate allocation rules in a clinical trial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
