Stochastic bandits robust to adversarial corruptions

Thodoris Lykouris; Vahab Mirrokni; Renato Paes Leme

arXiv:1803.09353·cs.LG·March 28, 2018·5 cites

Stochastic bandits robust to adversarial corruptions

Thodoris Lykouris, Vahab Mirrokni, Renato Paes Leme

PDF

Open Access

TL;DR

This paper introduces a new stochastic bandit model resilient to adversarial corruptions, proposing an algorithm that maintains near-optimal performance in stochastic settings and degrades gracefully with adversarial interference.

Contribution

It presents a simple, adaptive bandit algorithm that is robust to adversarial corruptions and provides theoretical guarantees on its performance degradation.

Findings

01

Algorithm's performance degrades linearly with corruption level C

02

Retains near-optimal guarantees in purely stochastic settings

03

Lower bound shows linear degradation is necessary for optimal stochastic performance

Abstract

We introduce a new model of stochastic bandits with adversarial corruptions which aims to capture settings where most of the input follows a stochastic pattern but some fraction of it can be adversarially changed to trick the algorithm, e.g., click fraud, fake reviews and email spam. The goal of this model is to encourage the design of bandit algorithms that (i) work well in mixed adversarial and stochastic models, and (ii) whose performance deteriorates gracefully as we move from fully stochastic to fully adversarial models. In our model, the rewards for all arms are initially drawn from a distribution and are then altered by an adaptive adversary. We provide a simple algorithm whose performance gracefully degrades with the total corruption the adversary injected in the data, measured by the sum across rounds of the biggest alteration the adversary made in the data in that round;…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Reinforcement Learning in Robotics