Structured Linear Contextual Bandits: A Sharp and Geometric Smoothed Analysis
Vidyashankar Sivakumar, Zhiwei Steven Wu, Arindam Banerjee

TL;DR
This paper introduces a smoothed setting for structured linear contextual bandits with Gaussian noise perturbations, proposing simple greedy algorithms and providing unified regret bounds that leverage geometric properties of the parameter structure.
Contribution
It presents a unified analysis of greedy algorithms for structured linear bandits under Gaussian smoothing, with sharper regret bounds and geometric insights.
Findings
Greedy algorithms perform well in the smoothed setting.
Regret bounds depend on Gaussian widths related to structure.
Sharper bounds achieved for unstructured parameters.
Abstract
Bandit learning algorithms typically involve the balance of exploration and exploitation. However, in many practical applications, worst-case scenarios needing systematic exploration are seldom encountered. In this work, we consider a smoothed setting for structured linear contextual bandits where the adversarial contexts are perturbed by Gaussian noise and the unknown parameter has structure, e.g., sparsity, group sparsity, low rank, etc. We propose simple greedy algorithms for both the single- and multi-parameter (i.e., different parameter for each context) settings and provide a unified regret analysis for with any assumed structure. The regret bounds are expressed in terms of geometric quantities such as Gaussian widths associated with the structure of . We also obtain sharper regret bounds compared to earlier work for the unstructured …
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Sparse and Compressive Sensing Techniques · Machine Learning and Algorithms
