Variance-Aware Sparse Linear Bandits

Yan Dai; Ruosong Wang; Simon S. Du

arXiv:2205.13450·cs.LG·February 8, 2023

Variance-Aware Sparse Linear Bandits

Yan Dai, Ruosong Wang, Simon S. Du

PDF

Open Access 1 Video

TL;DR

This paper introduces the first variance-aware regret bounds for sparse linear bandits, which adapt to noise levels and interpolate between worst-case and benign settings, using a novel black-box framework.

Contribution

It provides a new variance-aware regret guarantee for sparse linear bandits and develops a general framework to adapt existing algorithms in a black-box manner.

Findings

01

Achieves a regret bound of rom or worst-case to benign regimes.

02

Develops a black-box framework for variance-aware sparse linear bandit algorithms.

03

Demonstrates the bounds with two recent algorithms, one handling unknown variance, the other more efficient.

Abstract

It is well-known that for sparse linear bandits, when ignoring the dependency on sparsity which is much smaller than the ambient dimension, the worst-case minimax regret is $Θ (d T)$ where $d$ is the ambient dimension and $T$ is the number of rounds. On the other hand, in the benign setting where there is no noise and the action set is the unit sphere, one can use divide-and-conquer to achieve $O (1)$ regret, which is (nearly) independent of $d$ and $T$ . In this paper, we present the first variance-aware regret guarantee for sparse linear bandits: $O (d \sum_{t = 1}^{T} σ_{t}^{2} + 1)$ , where $σ_{t}^{2}$ is the variance of the noise at the $t$ -th round. This bound naturally interpolates the regret bounds for the worst-case constant-variance regime (i.e., $σ_{t} \equiv Ω (1)$ ) and the benign…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Variance-Aware Sparse Linear Bandits· slideslive

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Smart Grid Energy Management