Suboptimal Performance of the Bayes Optimal Algorithm in Frequentist Best Arm Identification
Junpei Komiyama

TL;DR
This paper shows that the Bayes optimal algorithm does not always achieve exponential decay in simple regret for best arm identification with normal rewards, challenging prior assumptions about Bayesian and frequentist equivalence.
Contribution
It reveals the suboptimal performance of the Bayes optimal algorithm in certain settings and introduces the concept of expected Bellman improvement for future research.
Findings
Bayes optimal algorithm does not guarantee exponential simple regret decay.
Contrasts with previous beliefs about Bayesian and frequentist asymptotic equivalence.
Introduces the concept of expected Bellman improvement for analyzing Bayesian algorithms.
Abstract
We consider the fixed-budget best arm identification problem with rewards following normal distributions. In this problem, the forecaster is given arms (or treatments) and time steps. The forecaster attempts to find the arm with the largest mean, via an adaptive experiment conducted using an algorithm. The algorithm's performance is evaluated by simple regret, reflecting the quality of the estimated best arm. While frequentist simple regret can decrease exponentially with respect to , Bayesian simple regret decreases polynomially. This paper demonstrates that the Bayes optimal algorithm, which minimizes the Bayesian simple regret, does not yield an exponential decrease in simple regret under certain parameter settings. This contrasts with the numerous findings that suggest the asymptotic equivalence of Bayesian and frequentist approaches in fixed sampling regimes. Although…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Auction Theory and Applications · Forecasting Techniques and Applications
