Distributed Multi-Agent Bandits Over Erd\H{o}s-R\'enyi Random Networks
Jingyuan Liu, Hao Qiu, Lin Yang, Mengfan Xu

TL;DR
This paper introduces a distributed algorithm for multi-agent multi-armed bandits over Erdős-Rényi random networks, achieving near-optimal regret bounds that account for network connectivity and randomness.
Contribution
It proposes a novel fully distributed algorithm combining arm elimination and gossip strategies, with theoretical regret bounds that incorporate network properties and randomness.
Findings
Regret scales as O(log T), matching centralized bounds.
Network connectivity and link probability significantly affect regret.
Numerical results confirm theoretical predictions and algorithm superiority.
Abstract
We study the distributed multi-agent multi-armed bandit problem with heterogeneous rewards over random communication graphs. Uniquely, at each time step agents communicate over a time-varying random graph generated by applying the Erd\H{o}s-R\'enyi model to a fixed connected base graph (for classical Erd\H{o}s-R\'enyi graphs, is a complete graph), where each potential edge in is randomly and independently present with the link probability . Notably, the resulting random graph is not necessarily connected at each time step. Each agent's arm rewards follow time-invariant distributions, and the reward distribution for the same arm may differ across agents. The goal is to minimize the cumulative expected regret relative to the global mean reward of each arm, defined as the average of that arm's mean rewards across all agents. To this end, we propose a fully…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
