Bandit Theory meets Compressed Sensing for high dimensional Stochastic Linear Bandit
Alexandra Carpentier (INRIA Lille - Nord Europe), R\'emi Munos (INRIA, Lille - Nord Europe)

TL;DR
This paper introduces a novel approach combining compressed sensing and bandit theory to efficiently handle high-dimensional stochastic linear bandit problems with sparse parameters, achieving regret bounds independent of the ambient dimension.
Contribution
It proposes new algorithms that leverage sparsity and compressed sensing techniques to attain regret bounds proportional to the sparsity level, improving over traditional methods in high-dimensional settings.
Findings
Achieves regret bounds of O(S√n) for sparse parameters
Demonstrates the effectiveness of combining compressed sensing with bandit algorithms
Provides theoretical guarantees for high-dimensional linear bandit problems
Abstract
We consider a linear stochastic bandit problem where the dimension of the unknown parameter is larger than the sampling budget . In such cases, it is in general impossible to derive sub-linear regret bounds since usual linear bandit algorithms have a regret in . In this paper we assume that is sparse, i.e. has at most non-zero components, and that the space of arms is the unit ball for the norm. We combine ideas from Compressed Sensing and Bandit Theory and derive algorithms with regret bounds in .
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Sparse and Compressive Sensing Techniques · Stochastic processes and financial applications
