Sparsity-Agnostic Linear Bandits with Adaptive Adversaries

Tianyuan Jin; Kyoungseok Jang; Nicol\`o Cesa-Bianchi

arXiv:2406.01192·cs.LG·June 4, 2024

Sparsity-Agnostic Linear Bandits with Adaptive Adversaries

Tianyuan Jin, Kyoungseok Jang, Nicol\`o Cesa-Bianchi

PDF

Open Access 1 Video

TL;DR

This paper introduces a new approach for stochastic linear bandits that adaptively handles unknown sparsity levels and adversarial action sets, achieving optimal regret bounds and improved empirical performance.

Contribution

It presents the first sparse regret bounds for unknown sparsity in adversarial settings and develops a novel randomized model selection technique.

Findings

01

Achieves sparse regret bounds with unknown sparsity S.

02

Recovers state-of-the-art bounds when S is known.

03

Improves empirical performance using a variant with Exp3.

Abstract

We study stochastic linear bandits where, in each round, the learner receives a set of actions (i.e., feature vectors), from which it chooses an element and obtains a stochastic reward. The expected reward is a fixed but unknown linear function of the chosen action. We study sparse regret bounds, that depend on the number $S$ of non-zero coefficients in the linear reward function. Previous works focused on the case where $S$ is known, or the action sets satisfy additional assumptions. In this work, we obtain the first sparse regret bounds that hold when $S$ is unknown and the action sets are adversarially generated. Our techniques combine online to confidence set conversions with a novel randomized model selection approach over a hierarchy of nested confidence sets. When $S$ is known, our analysis recovers state-of-the-art bounds for adversarial action sets. We also show that a variant…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Sparsity-Agnostic Linear Bandits with Adaptive Adversaries· slideslive

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Distributed Sensor Networks and Detection Algorithms

MethodsSparse Evolutionary Training