Asynchronous Parallel Empirical Variance Guided Algorithms for the   Thresholding Bandit Problem

Jie Zhong; Yijun Huang; Ji Liu

arXiv:1704.04567·stat.ML·July 11, 2017·6 cites

Asynchronous Parallel Empirical Variance Guided Algorithms for the Thresholding Bandit Problem

Jie Zhong, Yijun Huang, Ji Liu

PDF

Open Access

TL;DR

This paper introduces asynchronous parallel algorithms guided by empirical variance for the thresholding bandit problem, improving efficiency, applicability, and optimality over previous methods by allowing decision-making with unobserved rewards.

Contribution

It proposes novel asynchronous parallel algorithms that use empirical variance, are parameter-free, and do not require immediate reward observation, enhancing efficiency and practical applicability.

Findings

01

Algorithms significantly reduce round complexity.

02

Proven to be optimal under bounded high order moments.

03

Allow decision-making with unobserved rewards, improving practicality.

Abstract

This paper considers the multi-armed thresholding bandit problem -- identifying all arms whose expected rewards are above a predefined threshold via as few pulls (or rounds) as possible -- proposed by Locatelli et al. [2016] recently. Although the proposed algorithm in Locatelli et al. [2016] achieves the optimal round complexity in a certain sense, there still remain unsolved issues. This paper proposes an asynchronous parallel thresholding algorithm and its parameter-free version to improve the efficiency and the applicability. On one hand, the proposed two algorithms use the empirical variance to guide the pull decision at each round, and significantly improve the round complexity of the "optimal" algorithm when all arms have bounded high order moments. The proposed algorithms can be proven to be optimal. On the other hand, most bandit algorithms assume that the reward can be…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Optimization and Search Problems · Machine Learning and Algorithms