Best Arm Identification for Stochastic Rising Bandits

Marco Mussi; Alessandro Montenegro; Francesco Trov\'o; Marcello; Restelli; Alberto Maria Metelli

arXiv:2302.07510·cs.LG·May 29, 2024

Best Arm Identification for Stochastic Rising Bandits

Marco Mussi, Alessandro Montenegro, Francesco Trov\'o, Marcello, Restelli, Alberto Maria Metelli

PDF

Open Access 1 Repo

TL;DR

This paper studies the fixed-budget best arm identification problem in stochastic rising bandits, proposing algorithms with theoretical guarantees and validating their performance through experiments.

Contribution

It introduces two algorithms for fixed-budget BAI in SRBs, providing theoretical guarantees and matching lower bounds, a novel focus in this setting.

Findings

01

Algorithms achieve high probability of correct identification with large budgets.

02

Proved lower bounds match the performance of R-SR algorithm.

03

Validated effectiveness through experiments on synthetic and real data.

Abstract

Stochastic Rising Bandits (SRBs) model sequential decision-making problems in which the expected reward of the available options increases every time they are selected. This setting captures a wide range of scenarios in which the available options are learning entities whose performance improves (in expectation) over time (e.g., online best model selection). While previous works addressed the regret minimization problem, this paper focuses on the fixed-budget Best Arm Identification (BAI) problem for SRBs. In this scenario, given a fixed budget of rounds, we are asked to provide a recommendation about the best option at the end of the identification process. We propose two algorithms to tackle the above-mentioned setting, namely R-UCBE, which resorts to a UCB-like approach, and R-SR, which employs a successive reject procedure. Then, we prove that, with a sufficiently large budget, they…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

montenegroalessandro/bestarmidsrb
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Data Stream Mining Techniques · Reinforcement Learning in Robotics