Robust Multi-Agent Bandits Over Undirected Graphs

Daniel Vial; Sanjay Shakkottai; R. Srikant

arXiv:2203.00076·cs.LG·January 30, 2023

Robust Multi-Agent Bandits Over Undirected Graphs

Daniel Vial, Sanjay Shakkottai, R. Srikant

PDF

Open Access

TL;DR

This paper studies multi-agent bandit algorithms over networks with malicious agents, showing that existing methods fail on line graphs and proposing a new algorithm with regret bounds that depend on local malicious neighbors.

Contribution

The paper introduces a new algorithm for multi-agent bandits over arbitrary connected graphs, with regret bounds depending on local malicious neighbors, extending prior results beyond complete graphs.

Findings

01

Existing algorithms suffer nearly linear regret on line graphs.

02

The proposed algorithm achieves regret depending on local malicious neighbors.

03

Regret bounds are generalized to any connected undirected graph.

Abstract

We consider a multi-agent multi-armed bandit setting in which $n$ honest agents collaborate over a network to minimize regret but $m$ malicious agents can disrupt learning arbitrarily. Assuming the network is the complete graph, existing algorithms incur $O ((m + K / n) lo g (T) /Δ)$ regret in this setting, where $K$ is the number of arms and $Δ$ is the arm gap. For $m ≪ K$ , this improves over the single-agent baseline regret of $O (K lo g (T) /Δ)$ . In this work, we show the situation is murkier beyond the case of a complete graph. In particular, we prove that if the state-of-the-art algorithm is used on the undirected line graph, honest agents can suffer (nearly) linear regret until time is doubly exponential in $K$ and $n$ . In light of this negative result, we propose a new algorithm for which the $i$ -th agent has regret $O ((d_{mal} (i) + K / n) lo g (T) /Δ)$ …

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Optimization and Search Problems · Misinformation and Its Impacts