Communication-Efficient Collaborative Regret Minimization in Multi-Armed   Bandits

Nikolai Karpov; Qin Zhang

arXiv:2301.11442·cs.LG·December 22, 2023

Communication-Efficient Collaborative Regret Minimization in Multi-Armed Bandits

Nikolai Karpov, Qin Zhang

PDF

Open Access

TL;DR

This paper explores the balance between communication frequency and regret minimization in multi-agent multi-armed bandit problems, providing new tradeoff insights for collaborative learning efficiency.

Contribution

It introduces the first tradeoff analysis between communication rounds and regret in collaborative multi-armed bandit settings.

Findings

01

Established fundamental tradeoffs between communication and regret

02

Provided theoretical bounds for collaborative learning efficiency

03

Analyzed the impact of communication frequency on regret minimization

Abstract

In this paper, we study the collaborative learning model, which concerns the tradeoff between parallelism and communication overhead in multi-agent multi-armed bandits. For regret minimization in multi-armed bandits, we present the first set of tradeoffs between the number of rounds of communication among the agents and the regret of the collaborative learning process.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Data Stream Mining Techniques · Distributed Sensor Networks and Detection Algorithms