Some performance considerations when using multi-armed bandit algorithms   in the presence of missing data

Xijin Chen; Kim May Lee; Sofia S. Villar; and David S. Robertson

arXiv:2205.03820·stat.ML·October 12, 2022

Some performance considerations when using multi-armed bandit algorithms in the presence of missing data

Xijin Chen, Kim May Lee, Sofia S. Villar, and David S. Robertson

PDF

TL;DR

This study examines how ignoring missing data affects the performance of multi-armed bandit algorithms in clinical trial simulations, revealing that the impact varies with the exploration-exploitation balance and can be mitigated by simple imputation.

Contribution

It provides an extensive simulation analysis of missing data effects on different bandit algorithms, highlighting the importance of handling missingness appropriately in practice.

Findings

01

Ignoring missing data impacts algorithms differently based on their exploration-exploitation focus.

02

Exploration-oriented algorithms tend to over-sample arms with missing responses.

03

Simple mean imputation can mitigate missing data issues for exploration-focused algorithms.

Abstract

When comparing the performance of multi-armed bandit algorithms, the potential impact of missing data is often overlooked. In practice, it also affects their implementation where the simplest approach to overcome this is to continue to sample according to the original bandit algorithm, ignoring missing outcomes. We investigate the impact on performance of this approach to deal with missing data for several bandit algorithms through an extensive simulation study assuming the rewards are missing at random. We focus on two-armed bandit algorithms with binary outcomes in the context of patient allocation for clinical trials with relatively small sample sizes. However, our results apply to other applications of bandit algorithms where missing data is expected to occur. We assess the resulting operating characteristics, including the expected reward. Different probabilities of missingness in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.