Improved Regret of Linear Ensemble Sampling

Harin Lee; Min-hwan Oh

arXiv:2411.03932·stat.ML·June 17, 2025

Improved Regret of Linear Ensemble Sampling

Harin Lee, Min-hwan Oh

PDF

Open Access 1 Video

TL;DR

This paper presents an improved theoretical regret bound for linear ensemble sampling in bandit problems, matching state-of-the-art results and revealing a key relationship with LinPHE, thus advancing the understanding of randomized exploration algorithms.

Contribution

It introduces a general regret analysis framework for linear bandit algorithms and establishes a connection between linear ensemble sampling and LinPHE.

Findings

01

Achieves a regret bound of ^{3/2}b7b7T for linear ensemble sampling.

02

Shows LinPHE is a special case of linear ensemble sampling with ensemble size T.

03

Provides theoretical insights aligning ensemble sampling with other exploration algorithms.

Abstract

In this work, we close the fundamental gap of theory and practice by providing an improved regret bound for linear ensemble sampling. We prove that with an ensemble size logarithmic in $T$ , linear ensemble sampling can achieve a frequentist regret bound of $\tilde{O} (d^{3/2} T)$ , matching state-of-the-art results for randomized linear bandit algorithms, where $d$ and $T$ are the dimension of the parameter and the time horizon respectively. Our approach introduces a general regret analysis framework for linear bandit algorithms. Additionally, we reveal a significant relationship between linear ensemble sampling and Linear Perturbed-History Exploration (LinPHE), showing that LinPHE is a special case of linear ensemble sampling when the ensemble size equals $T$ . This insight allows our analysis framework to derive a regret bound of $\tilde{O} (d^{3/2} T)$ for LinPHE, independent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Improved Regret of Linear Ensemble Sampling· slideslive

Taxonomy

TopicsFace and Expression Recognition