Variance-Aware Sparse Linear Bandits
Yan Dai, Ruosong Wang, Simon S. Du

TL;DR
This paper introduces the first variance-aware regret bounds for sparse linear bandits, which adapt to noise levels and interpolate between worst-case and benign settings, using a novel black-box framework.
Contribution
It provides a new variance-aware regret guarantee for sparse linear bandits and develops a general framework to adapt existing algorithms in a black-box manner.
Findings
Achieves a regret bound of rom or worst-case to benign regimes.
Develops a black-box framework for variance-aware sparse linear bandit algorithms.
Demonstrates the bounds with two recent algorithms, one handling unknown variance, the other more efficient.
Abstract
It is well-known that for sparse linear bandits, when ignoring the dependency on sparsity which is much smaller than the ambient dimension, the worst-case minimax regret is where is the ambient dimension and is the number of rounds. On the other hand, in the benign setting where there is no noise and the action set is the unit sphere, one can use divide-and-conquer to achieve regret, which is (nearly) independent of and . In this paper, we present the first variance-aware regret guarantee for sparse linear bandits: , where is the variance of the noise at the -th round. This bound naturally interpolates the regret bounds for the worst-case constant-variance regime (i.e., ) and the benign…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Smart Grid Energy Management
