Communication-Efficient Collaborative Best Arm Identification

Nikolai Karpov; Qin Zhang

arXiv:2208.09029·cs.LG·November 29, 2022·1 cites

Communication-Efficient Collaborative Best Arm Identification

Nikolai Karpov, Qin Zhang

PDF

Open Access 1 Video

TL;DR

This paper addresses multi-agent bandit problems focusing on identifying the top arms efficiently through collaboration, emphasizing minimizing communication to maximize learning speedup, supported by theoretical and experimental results.

Contribution

It introduces new algorithms and impossibility results for collaborative top-m arm identification with minimized communication costs.

Findings

01

Algorithms achieve significant speedup over single-agent methods.

02

Communication-efficient algorithms outperform existing approaches.

03

Experimental results validate theoretical advantages.

Abstract

We investigate top- $m$ arm identification, a basic problem in bandit theory, in a multi-agent learning model in which agents collaborate to learn an objective function. We are interested in designing collaborative learning algorithms that achieve maximum speedup (compared to single-agent learning algorithms) using minimum communication cost, as communication is frequently the bottleneck in multi-agent learning. We give both algorithmic and impossibility results, and conduct a set of experiments to demonstrate the effectiveness of our algorithms.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Communication-Efficient Collaborative Best Arm Identification· underline

Taxonomy

TopicsData Stream Mining Techniques · Auction Theory and Applications · Advanced Bandit Algorithms Research