Asynchronous Parallel Empirical Variance Guided Algorithms for the Thresholding Bandit Problem
Jie Zhong, Yijun Huang, Ji Liu

TL;DR
This paper introduces asynchronous parallel algorithms guided by empirical variance for the thresholding bandit problem, improving efficiency, applicability, and optimality over previous methods by allowing decision-making with unobserved rewards.
Contribution
It proposes novel asynchronous parallel algorithms that use empirical variance, are parameter-free, and do not require immediate reward observation, enhancing efficiency and practical applicability.
Findings
Algorithms significantly reduce round complexity.
Proven to be optimal under bounded high order moments.
Allow decision-making with unobserved rewards, improving practicality.
Abstract
This paper considers the multi-armed thresholding bandit problem -- identifying all arms whose expected rewards are above a predefined threshold via as few pulls (or rounds) as possible -- proposed by Locatelli et al. [2016] recently. Although the proposed algorithm in Locatelli et al. [2016] achieves the optimal round complexity in a certain sense, there still remain unsolved issues. This paper proposes an asynchronous parallel thresholding algorithm and its parameter-free version to improve the efficiency and the applicability. On one hand, the proposed two algorithms use the empirical variance to guide the pull decision at each round, and significantly improve the round complexity of the "optimal" algorithm when all arms have bounded high order moments. The proposed algorithms can be proven to be optimal. On the other hand, most bandit algorithms assume that the reward can be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Optimization and Search Problems · Machine Learning and Algorithms
