Rotting Infinitely Many-armed Bandits

Jung-hun Kim; Milan Vojnovic; Se-Young Yun

arXiv:2201.12975·cs.LG·December 19, 2023

Rotting Infinitely Many-armed Bandits

Jung-hun Kim, Milan Vojnovic, Se-Young Yun

PDF

Open Access 1 Repo

TL;DR

This paper studies the infinitely many-armed bandit problem with rotting rewards, establishing tight regret bounds and proposing algorithms that adapt to unknown rotting rates, advancing understanding of non-stationary bandit challenges.

Contribution

It provides tight regret bounds for rotting rewards in infinite-armed bandits and introduces algorithms that adapt to unknown rotting rates.

Findings

01

Matching lower and upper regret bounds up to poly-log factors.

02

Algorithms with known and unknown rotting rates achieve near-optimal regret.

03

Adaptive algorithms effectively handle non-stationary reward decay.

Abstract

We consider the infinitely many-armed bandit problem with rotting rewards, where the mean reward of an arm decreases at each pull of the arm according to an arbitrary trend with maximum rotting rate $ϱ = o (1)$ . We show that this learning problem has an $Ω (max {ϱ^{1/3} T, T})$ worst-case regret lower bound where $T$ is the horizon time. We show that a matching upper bound $\tilde{O} (max {ϱ^{1/3} T, T})$ , up to a poly-logarithmic factor, can be achieved by an algorithm that uses a UCB index for each arm and a threshold value to decide whether to continue pulling an arm or remove the arm from further consideration, when the algorithm knows the value of the maximum rotting rate $ϱ$ . We also show that an $\tilde{O} (max {ϱ^{1/3} T, T^{3/4}})$ regret upper bound can be achieved by an algorithm that does not know the value of $ϱ$ , by using an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

junghunkim7786/rotting_infinite_armed_bandits
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Optimization and Search Problems