Simple Combinatorial Algorithms for Combinatorial Bandits: Corruptions   and Approximations

Haike Xu; Jian Li

arXiv:2106.06712·cs.LG·June 15, 2021

Simple Combinatorial Algorithms for Combinatorial Bandits: Corruptions and Approximations

Haike Xu, Jian Li

PDF

Open Access

TL;DR

This paper introduces a simple, combinatorial algorithm for stochastic combinatorial semi-bandits with adversarial corruptions, achieving near-optimal regret bounds with lower complexity and weaker assumptions compared to prior methods.

Contribution

The paper presents a new combinatorial algorithm that improves regret bounds for corrupted semi-bandit problems, simplifying implementation and reducing computational complexity.

Findings

01

Achieves regret of C + d^2K/elta_{min} with adversarial corruptions.

02

Outperforms previous combinatorial algorithms in regret bounds.

03

Requires weaker assumptions and has lower oracle complexity than existing methods.

Abstract

We consider the stochastic combinatorial semi-bandit problem with adversarial corruptions. We provide a simple combinatorial algorithm that can achieve a regret of $\tilde{O} (C + d^{2} K / Δ_{min})$ where $C$ is the total amount of corruptions, $d$ is the maximal number of arms one can play in each round, $K$ is the number of arms. If one selects only one arm in each round, we achieves a regret of $\tilde{O} (C + \sum_{Δ_{i} > 0} (1/ Δ_{i}))$ . Our algorithm is combinatorial and improves on the previous combinatorial algorithm by [Gupta et al., COLT2019] (their bound is $\tilde{O} (K C + \sum_{Δ_{i} > 0} (1/ Δ_{i}))$ ), and almost matches the best known bounds obtained by [Zimmert et al., ICML2019] and [Zimmert and Seldin, AISTATS2019] (up to logarithmic factor). Note that the algorithms in [Zimmert et al., ICML2019] and [Zimmert and Seldin, AISTATS2019]…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Optimization and Search Problems · Machine Learning and Algorithms