Thompson Sampling-like Algorithms for Stochastic Rising Bandits

Marco Fiandri; Alberto Maria Metelli; Francesco Trov\`o

arXiv:2505.12092·stat.ML·May 21, 2025

Thompson Sampling-like Algorithms for Stochastic Rising Bandits

Marco Fiandri, Alberto Maria Metelli, Francesco Trov\`o

PDF

Open Access

TL;DR

This paper explores Thompson sampling-like algorithms for stochastic rising bandits, providing new regret analyses, lower bounds, and empirical comparisons in scenarios where rewards increase over time.

Contribution

It introduces adapted TS algorithms for SRRB, offers novel regret bounds, and compares their performance with existing methods through simulations.

Findings

01

TS-like algorithms can achieve sublinear regret under certain conditions

02

A new complexity index influences regret bounds in SRRB

03

Numerical results show competitive performance of TS-like algorithms

Abstract

Stochastic rising rested bandit (SRRB) is a setting where the arms' expected rewards increase as they are pulled. It models scenarios in which the performances of the different options grow as an effect of an underlying learning process (e.g., online model selection). Even if the bandit literature provides specifically crafted algorithms based on upper-confidence bounds for such a setting, no study about Thompson sampling TS-like algorithms has been performed so far. The strong regularity of the expected rewards in the SRRB setting suggests that specific instances may be tackled effectively using adapted and sliding-window TS approaches. This work provides novel regret analyses for such algorithms in SRRBs, highlighting the challenges and providing new technical tools of independent interest. Our results allow us to identify under which assumptions TS-like algorithms succeed in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Mobile Crowdsensing and Crowdsourcing · Optimization and Search Problems

MethodsSpatio-temporal stability analysis