CRIMED: Lower and Upper Bounds on Regret for Bandits with Unbounded Stochastic Corruption
Shubhada Agrawal, Timoth\'ee Mathieu, Debabrota Basu, Odalric-Ambrym, Maillard

TL;DR
This paper introduces CRIMED, an optimal algorithm for multi-armed bandits with unbounded corruptions, providing tight regret bounds and robust median concentration results even with high corruption levels.
Contribution
We establish regret lower bounds and propose CRIMED, an asymptotically optimal algorithm that handles unbounded corruptions in bandit problems, with finite-sample guarantees and median concentration results.
Findings
CRIMED achieves the exact lower bound on regret for Gaussian bandits.
CRIMED effectively handles corruption levels up to 50%.
A new median concentration inequality under arbitrary corruptions is developed.
Abstract
We investigate the regret-minimisation problem in a multi-armed bandit setting with arbitrary corruptions. Similar to the classical setup, the agent receives rewards generated independently from the distribution of the arm chosen at each time. However, these rewards are not directly observed. Instead, with a fixed , the agent observes a sample from the chosen arm's distribution with probability , or from an arbitrary corruption distribution with probability . Importantly, we impose no assumptions on these corruption distributions, which can be unbounded. In this setting, accommodating potentially unbounded corruptions, we establish a problem-dependent lower bound on regret for a given family of arm distributions. We introduce CRIMED, an asymptotically-optimal algorithm that achieves the exact lower bound on regret for bandits…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Auction Theory and Applications · Optimization and Search Problems
