Thompson Sampling-like Algorithms for Stochastic Rising Bandits
Marco Fiandri, Alberto Maria Metelli, Francesco Trov\`o

TL;DR
This paper explores Thompson sampling-like algorithms for stochastic rising bandits, providing new regret analyses, lower bounds, and empirical comparisons in scenarios where rewards increase over time.
Contribution
It introduces adapted TS algorithms for SRRB, offers novel regret bounds, and compares their performance with existing methods through simulations.
Findings
TS-like algorithms can achieve sublinear regret under certain conditions
A new complexity index influences regret bounds in SRRB
Numerical results show competitive performance of TS-like algorithms
Abstract
Stochastic rising rested bandit (SRRB) is a setting where the arms' expected rewards increase as they are pulled. It models scenarios in which the performances of the different options grow as an effect of an underlying learning process (e.g., online model selection). Even if the bandit literature provides specifically crafted algorithms based on upper-confidence bounds for such a setting, no study about Thompson sampling TS-like algorithms has been performed so far. The strong regularity of the expected rewards in the SRRB setting suggests that specific instances may be tackled effectively using adapted and sliding-window TS approaches. This work provides novel regret analyses for such algorithms in SRRBs, highlighting the challenges and providing new technical tools of independent interest. Our results allow us to identify under which assumptions TS-like algorithms succeed in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Mobile Crowdsensing and Crowdsourcing · Optimization and Search Problems
MethodsSpatio-temporal stability analysis
