Algorithms for multi-armed bandit problems
Volodymyr Kuleshov, Doina Precup

TL;DR
This paper empirically evaluates popular multi-armed bandit algorithms, revealing that simple heuristics often outperform complex ones, and demonstrates their potential benefits in clinical trial settings for improved patient outcomes.
Contribution
The study provides a comprehensive empirical comparison of bandit algorithms and explores their application in clinical trials, highlighting practical advantages over traditional methods.
Findings
Simple heuristics outperform theoretically sound algorithms in most settings.
Algorithm performance varies with problem parameters, identifying optimal and poor settings.
Bandit algorithms could increase patient treatment success and reduce adverse effects in clinical trials.
Abstract
Although many algorithms for the multi-armed bandit problem are well-understood theoretically, empirical confirmation of their effectiveness is generally scarce. This paper presents a thorough empirical study of the most popular multi-armed bandit algorithms. Three important observations can be made from our results. Firstly, simple heuristics such as epsilon-greedy and Boltzmann exploration outperform theoretically sound algorithms on most settings by a significant margin. Secondly, the performance of most algorithms varies dramatically with the parameters of the bandit problem. Our study identifies for each algorithm the settings where it performs well, and the settings where it performs poorly. Thirdly, the algorithms' performance relative each to other is affected only by the number of bandit arms and the variance of the rewards. This finding may guide the design of subsequent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Optimization and Search Problems
