A Parallel algorithm for $\mathcal{X}$-Armed bandits
Cheng Chen, Shuang Liu, Zhihua Zhang, Wu-Jun Li

TL;DR
This paper introduces a distributed algorithm for $ ext{X}$-armed bandits that leverages multiple players to efficiently find the global maximum of an unknown function, significantly reducing computation time and communication rounds.
Contribution
The paper presents a novel anytime distributed $ ext{X}$-armed bandit algorithm with a unique search strategy suitable for large-scale, multi-player scenarios, achieving linear speedup and logarithmic communication complexity.
Findings
Algorithm is $m$ times faster than single-player methods.
Communication rounds are only logarithmic in $mn$.
Numerical results confirm effective multi-player utilization.
Abstract
The target of -armed bandit problem is to find the global maximum of an unknown stochastic function , given a finite budget of evaluations. Recently, -armed bandits have been widely used in many situations. Many of these applications need to deal with large-scale data sets. To deal with these large-scale data sets, we study a distributed setting of -armed bandits, where players collaborate to find the maximum of the unknown function. We develop a novel anytime distributed -armed bandit algorithm. Compared with prior work on -armed bandits, our algorithm uses a quite different searching strategy so as to fit distributed learning scenarios. Our theoretical analysis shows that our distributed algorithm is times faster than the classical single-player algorithm. Moreover, the number of communication rounds of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Advanced Bandit Algorithms Research · Optimization and Search Problems
