Beyond the Best: Estimating Distribution Functionals in Infinite-Armed   Bandits

Yifei Wang; Tavor Baharav; Yanjun Han; Jiantao Jiao; David Tse

arXiv:2211.01743·cs.LG·November 4, 2022·1 cites

Beyond the Best: Estimating Distribution Functionals in Infinite-Armed Bandits

Yifei Wang, Tavor Baharav, Yanjun Han, Jiantao Jiao, David Tse

PDF

Open Access

TL;DR

This paper develops unified algorithms for estimating various distribution functionals in infinite-armed bandits, demonstrating optimal sample complexities and revealing a threshold phenomenon in median estimation.

Contribution

It introduces a general framework for estimating distribution functionals beyond the maximum, with optimal algorithms and lower bounds for both offline and online settings.

Findings

01

Online estimation reduces sample complexity for certain functionals.

02

Optimal algorithms match lower bounds across different functionals.

03

A thresholding phenomenon is identified in median estimation related to Gaussian convolutions.

Abstract

In the infinite-armed bandit problem, each arm's average reward is sampled from an unknown distribution, and each arm can be sampled further to obtain noisy estimates of the average reward of that arm. Prior work focuses on identifying the best arm, i.e., estimating the maximum of the average reward distribution. We consider a general class of distribution functionals beyond the maximum, and propose unified meta algorithms for both the offline and online settings, achieving optimal sample complexities. We show that online estimation, where the learner can sequentially choose whether to sample a new or existing arm, offers no advantage over the offline setting for estimating the mean functional, but significantly reduces the sample complexity for other functionals such as the median, maximum, and trimmed mean. The matching lower bounds utilize several different Wasserstein distances. For…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Mobile Crowdsensing and Crowdsourcing