Cooperative Multi-Agent Bandits with Heavy Tails

Abhimanyu Dubey; Alex Pentland

arXiv:2008.06244·cs.LG·August 17, 2020·5 cites

Cooperative Multi-Agent Bandits with Heavy Tails

Abhimanyu Dubey, Alex Pentland

PDF

Open Access 1 Video

TL;DR

This paper introduces MP-UCB, a robust decentralized algorithm for cooperative multi-agent heavy-tailed bandit problems, achieving optimal regret bounds and surpassing existing methods.

Contribution

It presents the first robust, decentralized algorithm with proven optimal regret bounds for heavy-tailed cooperative bandits, along with new lower bounds for the problem.

Findings

01

MP-UCB outperforms existing algorithms in heavy-tailed settings.

02

The paper establishes the first lower bounds for cooperative heavy-tailed bandits.

03

Proves optimal regret bounds for MP-UCB across multiple problem scenarios.

Abstract

We study the heavy-tailed stochastic bandit problem in the cooperative multi-agent setting, where a group of agents interact with a common bandit problem, while communicating on a network with delays. Existing algorithms for the stochastic bandit in this setting utilize confidence intervals arising from an averaging-based communication protocol known as~\textit{running consensus}, that does not lend itself to robust estimation for heavy-tailed settings. We propose \textsc{MP-UCB}, a decentralized multi-agent algorithm for the cooperative stochastic bandit that incorporates robust estimation with a message-passing protocol. We prove optimal regret bounds for \textsc{MP-UCB} for several problem settings, and also demonstrate its superiority to existing methods. Furthermore, we establish the first lower bounds for the cooperative bandit problem, in addition to providing efficient…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Cooperative Multi-Agent Bandits with Heavy Tails· slideslive

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Distributed Sensor Networks and Detection Algorithms · Optimization and Search Problems