Simple Combinatorial Algorithms for Combinatorial Bandits: Corruptions and Approximations
Haike Xu, Jian Li

TL;DR
This paper introduces a simple, combinatorial algorithm for stochastic combinatorial semi-bandits with adversarial corruptions, achieving near-optimal regret bounds with lower complexity and weaker assumptions compared to prior methods.
Contribution
The paper presents a new combinatorial algorithm that improves regret bounds for corrupted semi-bandit problems, simplifying implementation and reducing computational complexity.
Findings
Achieves regret of C + d^2K/elta_{min} with adversarial corruptions.
Outperforms previous combinatorial algorithms in regret bounds.
Requires weaker assumptions and has lower oracle complexity than existing methods.
Abstract
We consider the stochastic combinatorial semi-bandit problem with adversarial corruptions. We provide a simple combinatorial algorithm that can achieve a regret of where is the total amount of corruptions, is the maximal number of arms one can play in each round, is the number of arms. If one selects only one arm in each round, we achieves a regret of . Our algorithm is combinatorial and improves on the previous combinatorial algorithm by [Gupta et al., COLT2019] (their bound is ), and almost matches the best known bounds obtained by [Zimmert et al., ICML2019] and [Zimmert and Seldin, AISTATS2019] (up to logarithmic factor). Note that the algorithms in [Zimmert et al., ICML2019] and [Zimmert and Seldin, AISTATS2019]…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Optimization and Search Problems · Machine Learning and Algorithms
