Structured Linear Contextual Bandits: A Sharp and Geometric Smoothed   Analysis

Vidyashankar Sivakumar; Zhiwei Steven Wu; Arindam Banerjee

arXiv:2002.11332·cs.LG·February 27, 2020·6 cites

Structured Linear Contextual Bandits: A Sharp and Geometric Smoothed Analysis

Vidyashankar Sivakumar, Zhiwei Steven Wu, Arindam Banerjee

PDF

Open Access 1 Video

TL;DR

This paper introduces a smoothed setting for structured linear contextual bandits with Gaussian noise perturbations, proposing simple greedy algorithms and providing unified regret bounds that leverage geometric properties of the parameter structure.

Contribution

It presents a unified analysis of greedy algorithms for structured linear bandits under Gaussian smoothing, with sharper regret bounds and geometric insights.

Findings

01

Greedy algorithms perform well in the smoothed setting.

02

Regret bounds depend on Gaussian widths related to structure.

03

Sharper bounds achieved for unstructured parameters.

Abstract

Bandit learning algorithms typically involve the balance of exploration and exploitation. However, in many practical applications, worst-case scenarios needing systematic exploration are seldom encountered. In this work, we consider a smoothed setting for structured linear contextual bandits where the adversarial contexts are perturbed by Gaussian noise and the unknown parameter $θ^{*}$ has structure, e.g., sparsity, group sparsity, low rank, etc. We propose simple greedy algorithms for both the single- and multi-parameter (i.e., different parameter for each context) settings and provide a unified regret analysis for $θ^{*}$ with any assumed structure. The regret bounds are expressed in terms of geometric quantities such as Gaussian widths associated with the structure of $θ^{*}$ . We also obtain sharper regret bounds compared to earlier work for the unstructured $θ^{*}$ …

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Structured Linear Contextual Bandits: A Sharp and Geometric Smoothed Analysis· slideslive

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Sparse and Compressive Sensing Techniques · Machine Learning and Algorithms