Replicable Constrained Bandits

Matteo Bollini; Gianmarco Genalti; Francesco Emanuele Stradi; Matteo Castiglioni; Alberto Marchesi

arXiv:2602.14580·cs.LG·February 17, 2026

Replicable Constrained Bandits

Matteo Bollini, Gianmarco Genalti, Francesco Emanuele Stradi, Matteo Castiglioni, Alberto Marchesi

PDF

Open Access

TL;DR

This paper introduces the concept of replicability in constrained multi-armed bandit problems, providing algorithms that ensure consistent decision-making across runs while maintaining optimal regret and constraint satisfaction.

Contribution

It develops the first replicable algorithms for constrained MABs and a replicable UCB-like algorithm for unconstrained MABs, advancing reproducibility in online learning.

Findings

01

Replicable algorithms match non-replicable ones in regret and constraint violation.

02

Optimism in-the-face-of-uncertainty can be used to achieve replicability.

03

First replicable UCB-like algorithm for unconstrained MABs.

Abstract

Algorithmic \emph{replicability} has recently been introduced to address the need for reproducible experiments in machine learning. A \emph{replicable online learning} algorithm is one that takes the same sequence of decisions across different executions in the same environment, with high probability. We initiate the study of algorithmic replicability in \emph{constrained} MAB problems, where a learner interacts with an unknown stochastic environment for $T$ rounds, seeking not only to maximize reward but also to satisfy multiple constraints. Our main result is that replicability can be achieved in constrained MABs. Specifically, we design replicable algorithms whose regret and constraint violation match those of non-replicable ones in terms of $T$ . As a key step toward these guarantees, we develop the first replicable UCB-like algorithm for \emph{unconstrained} MABs, showing that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Stochastic Gradient Optimization Techniques