PopArt: Efficient Sparse Regression and Experimental Design for Optimal   Sparse Linear Bandits

Kyoungseok Jang; Chicheng Zhang; Kwang-Sung Jun

arXiv:2210.15345·stat.ML·November 21, 2023

PopArt: Efficient Sparse Regression and Experimental Design for Optimal Sparse Linear Bandits

Kyoungseok Jang, Chicheng Zhang, Kwang-Sung Jun

PDF

Open Access 1 Repo 1 Video

TL;DR

PopArt introduces a computationally efficient sparse linear estimation method with tighter recovery guarantees, leading to improved regret bounds in sparse linear bandits and a matching lower bound in data-poor regimes.

Contribution

The paper presents PopArt, a new sparse linear estimator with better guarantees, and develops bandit algorithms with improved regret bounds based on a novel experimental design.

Findings

01

PopArt achieves tighter $\, ext{ell}_1$ recovery guarantees than Lasso.

02

The proposed algorithms improve regret bounds over previous methods.

03

A matching lower bound is established in the data-poor regime.

Abstract

In sparse linear bandits, a learning agent sequentially selects an action and receive reward feedback, and the reward function depends linearly on a few coordinates of the covariates of the actions. This has applications in many real-world sequential decision making problems. In this paper, we propose a simple and computationally efficient sparse linear estimation method called PopArt that enjoys a tighter $ℓ_{1}$ recovery guarantee compared to Lasso (Tibshirani, 1996) in many problems. Our bound naturally motivates an experimental design criterion that is convex and thus computationally efficient to solve. Based on our novel estimator and design criterion, we derive sparse linear bandit algorithms that enjoy improved regret upper bounds upon the state of the art (Hao et al., 2020), especially w.r.t. the geometry of the given action set. Finally, we prove a matching lower bound for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jajajang/sparse
noneOfficial

Videos

PopArt: Efficient Sparse Regression and Experimental Design for Optimal Sparse Linear Bandits· slideslive

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Distributed Sensor Networks and Detection Algorithms