Robust Decentralized Multi-armed Bandits: From Corruption-Resilience to Byzantine-Resilience

Zicheng Hu; Yuchen Wang; Cheng Chen

arXiv:2511.10344·cs.LG·November 14, 2025

Robust Decentralized Multi-armed Bandits: From Corruption-Resilience to Byzantine-Resilience

Zicheng Hu, Yuchen Wang, Cheng Chen

PDF

Open Access 1 Video

TL;DR

This paper introduces DeMABAR, a robust decentralized multi-armed bandit algorithm resilient to adversarial corruption and Byzantine attacks, ensuring low regret and robustness in multi-agent collaborative settings.

Contribution

The paper develops a novel algorithm, DeMABAR, that provides resilience against both reward corruption and Byzantine agent attacks in decentralized multi-armed bandits.

Findings

01

DeMABAR achieves regret bounds proportional to corruption budget.

02

The algorithm is effective against Byzantine agents with arbitrary behaviors.

03

Numerical experiments confirm robustness and practical effectiveness.

Abstract

Decentralized cooperative multi-agent multi-armed bandits (DeCMA2B) considers how multiple agents collaborate in a decentralized multi-armed bandit setting. Though this problem has been extensively studied in previous work, most existing methods remain susceptible to various adversarial attacks. In this paper, we first study DeCMA2B with adversarial corruption, where an adversary can corrupt reward observations of all agents with a limited corruption budget. We propose a robust algorithm, called DeMABAR, which ensures that each agent's individual regret suffers only an additive term proportional to the corruption budget. Then we consider a more realistic scenario where the adversary can only attack a small number of agents. Our theoretical analysis shows that the DeMABAR algorithm can also almost completely eliminate the influence of adversarial attacks and is inherently robust in the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Robust Decentralized Multi-armed Bandits: From Corruption-Resilience to Byzantine-Resilience· underline

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques · Game Theory and Applications