On the Minimax Regret for Linear Bandits in a wide variety of Action   Spaces

Debangshu Banerjee; Aditya Gopalan

arXiv:2301.03597·cs.LG·January 11, 2023

On the Minimax Regret for Linear Bandits in a wide variety of Action Spaces

Debangshu Banerjee, Aditya Gopalan

PDF

Open Access

TL;DR

This paper establishes an optimal lower bound on the minimax regret for linear bandit problems across diverse convex action spaces, addressing a long-standing open problem in the field.

Contribution

It provides the first comprehensive lower bound characterization for linear bandits in various convex action spaces, advancing theoretical understanding.

Findings

01

Optimal regret lower bound derived for convex action spaces

02

Addresses open problem in linear bandit theory

03

Enhances understanding of regret minimization in complex action spaces

Abstract

As noted in the works of \cite{lattimore2020bandit}, it has been mentioned that it is an open problem to characterize the minimax regret of linear bandits in a wide variety of action spaces. In this article we present an optimal regret lower bound for a wide class of convex action spaces.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research