A single algorithm for both restless and rested rotting bandits

Julien Seznec; Pierre M\'enard; Alessandro Lazaric; Michal Valko

arXiv:2604.21432·stat.ML·April 24, 2026·6 cites

A single algorithm for both restless and rested rotting bandits

Julien Seznec, Pierre M\'enard, Alessandro Lazaric, Michal Valko

PDF

1 Datasets

TL;DR

This paper introduces RAW-UCB, a novel algorithm that effectively handles both rested and restless rotting bandit problems, achieving near-optimal regret without prior knowledge of the setting or non-stationarity type.

Contribution

The paper presents RAW-UCB, the first algorithm to adaptively perform well in both rested and restless rotting bandit scenarios without prior setting knowledge.

Findings

01

RAW-UCB achieves near-optimal regret in synthetic experiments.

02

The algorithm performs well on dataset-based experiments.

03

It outperforms existing algorithms in both settings.

Abstract

In many application domains (e.g., recommender systems, intelligent tutoring systems), the rewards associated to the actions tend to decrease over time. This decay is either caused by the actions executed in the past (e.g., a user may get bored when songs of the same genre are recommended over and over) or by an external factor (e.g., content becomes outdated). These two situations can be modeled as specific instances of the rested and restless bandit settings, where arms are rotting (i.e., their value decrease over time). These problems were thought to be significantly different, since Levine et al. (2017) showed that state-of-the-art algorithms for restless bandit perform poorly in the rested rotting setting. In this paper, we introduce a novel algorithm, Rotting Adaptive Window UCB (RAW-UCB), that achieves near-optimal regret in both rotting rested and restless bandit, without any…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

misovalko/my-research-papers
dataset· 103 dl
103 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.