Almost Minimax Optimal Best Arm Identification in Piecewise Stationary   Linear Bandits

Yunlong Hou; Vincent Y. F. Tan; Zixin Zhong

arXiv:2410.07638·cs.LG·October 11, 2024

Almost Minimax Optimal Best Arm Identification in Piecewise Stationary Linear Bandits

Yunlong Hou, Vincent Y. F. Tan, Zixin Zhong

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a novel piecewise stationary linear bandit model and an algorithm, PSεBAI+, that efficiently identifies near-optimal arms with minimal samples despite unknown environments and changepoints.

Contribution

The paper presents a new model for piecewise stationary linear bandits and an optimal algorithm that detects changepoints and aligns contexts for effective arm identification.

Findings

01

PSεBAI+ achieves near-optimal sample complexity.

02

The algorithm effectively detects changepoints and aligns contexts.

03

Numerical experiments confirm the efficiency of PSεBAI+.

Abstract

We propose a {\em novel} piecewise stationary linear bandit (PSLB) model, where the environment randomly samples a context from an unknown probability distribution at each changepoint, and the quality of an arm is measured by its return averaged over all contexts. The contexts and their distribution, as well as the changepoints are unknown to the agent. We design {\em Piecewise-Stationary $ε$ -Best Arm Identification $^{+}$ } (PS $ε$ BAI $^{+}$ ), an algorithm that is guaranteed to identify an $ε$ -optimal arm with probability $\geq 1 - δ$ and with a minimal number of samples. PS $ε$ BAI $^{+}$ consists of two subroutines, PS $ε$ BAI and {\sc Na\"ive $ε$ -BAI} (N $ε$ BAI), which are executed in parallel. PS $ε$ BAI actively detects changepoints and aligns contexts to facilitate the arm identification process. When…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Y-Hou/BAI-in-PSLB
noneOfficial

Videos

Almost Minimax Optimal Best Arm Identification in Piecewise Stationary Linear Bandits· slideslive

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Optimization and Search Problems · Machine Learning and Algorithms