Some performance considerations when using multi-armed bandit algorithms in the presence of missing data
Xijin Chen, Kim May Lee, Sofia S. Villar, and David S. Robertson

TL;DR
This study examines how ignoring missing data affects the performance of multi-armed bandit algorithms in clinical trial simulations, revealing that the impact varies with the exploration-exploitation balance and can be mitigated by simple imputation.
Contribution
It provides an extensive simulation analysis of missing data effects on different bandit algorithms, highlighting the importance of handling missingness appropriately in practice.
Findings
Ignoring missing data impacts algorithms differently based on their exploration-exploitation focus.
Exploration-oriented algorithms tend to over-sample arms with missing responses.
Simple mean imputation can mitigate missing data issues for exploration-focused algorithms.
Abstract
When comparing the performance of multi-armed bandit algorithms, the potential impact of missing data is often overlooked. In practice, it also affects their implementation where the simplest approach to overcome this is to continue to sample according to the original bandit algorithm, ignoring missing outcomes. We investigate the impact on performance of this approach to deal with missing data for several bandit algorithms through an extensive simulation study assuming the rewards are missing at random. We focus on two-armed bandit algorithms with binary outcomes in the context of patient allocation for clinical trials with relatively small sample sizes. However, our results apply to other applications of bandit algorithms where missing data is expected to occur. We assess the resulting operating characteristics, including the expected reward. Different probabilities of missingness in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
