Sparsity-Agnostic Linear Bandits with Adaptive Adversaries
Tianyuan Jin, Kyoungseok Jang, Nicol\`o Cesa-Bianchi

TL;DR
This paper introduces a new approach for stochastic linear bandits that adaptively handles unknown sparsity levels and adversarial action sets, achieving optimal regret bounds and improved empirical performance.
Contribution
It presents the first sparse regret bounds for unknown sparsity in adversarial settings and develops a novel randomized model selection technique.
Findings
Achieves sparse regret bounds with unknown sparsity S.
Recovers state-of-the-art bounds when S is known.
Improves empirical performance using a variant with Exp3.
Abstract
We study stochastic linear bandits where, in each round, the learner receives a set of actions (i.e., feature vectors), from which it chooses an element and obtains a stochastic reward. The expected reward is a fixed but unknown linear function of the chosen action. We study sparse regret bounds, that depend on the number of non-zero coefficients in the linear reward function. Previous works focused on the case where is known, or the action sets satisfy additional assumptions. In this work, we obtain the first sparse regret bounds that hold when is unknown and the action sets are adversarially generated. Our techniques combine online to confidence set conversions with a novel randomized model selection approach over a hierarchy of nested confidence sets. When is known, our analysis recovers state-of-the-art bounds for adversarial action sets. We also show that a variant…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Distributed Sensor Networks and Detection Algorithms
MethodsSparse Evolutionary Training
