Approximate Top-$m$ Arm Identification with Heterogeneous Reward   Variances

Ruida Zhou; Chao Tian

arXiv:2204.05245·cs.LG·April 12, 2022

Approximate Top-$m$ Arm Identification with Heterogeneous Reward Variances

Ruida Zhou, Chao Tian

PDF

Open Access

TL;DR

This paper investigates the sample complexity of identifying top-$m$ arms in a multi-armed bandit setting with heterogenous reward variances, providing tight bounds that incorporate variance heterogeneity and entropy measures.

Contribution

It introduces a new complexity characterization for top-$m$ arm identification considering reward variance heterogeneity, with matching upper and lower bounds.

Findings

01

Derived the worst-case sample complexity involving variance heterogeneity and entropy.

02

Proposed a divide-and-conquer algorithm achieving the upper bound.

03

Established a matching lower bound through dual formulation analysis.

Abstract

We study the effect of reward variance heterogeneity in the approximate top- $m$ arm identification setting. In this setting, the reward for the $i$ -th arm follows a $σ_{i}^{2}$ -sub-Gaussian distribution, and the agent needs to incorporate this knowledge to minimize the expected number of arm pulls to identify $m$ arms with the largest means within error $ϵ$ out of the $n$ arms, with probability at least $1 - δ$ . We show that the worst-case sample complexity of this problem is $Θ i = 1 \sum n \frac{σ _{i}^{2}}{ϵ ^{2}} ln \frac{1}{δ} + i \in G^{m} \sum \frac{σ _{i}^{2}}{ϵ ^{2}} ln (m) + j \in G^{l} \sum \frac{σ _{j}^{2}}{ϵ ^{2}} Ent (σ_{G^{r}}^{2}),$ where $G^{m}, G^{l}, G^{r}$ are certain specific subsets of the overall arm set ${1, 2, \dots, n}$ , and $Ent (\cdot)$ is an entropy-like function which measures the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Statistical Methods and Inference · Markov Chains and Monte Carlo Methods