Federated Multi-Armed Bandits
Chengshuai Shi, Cong Shen

TL;DR
This paper introduces federated multi-armed bandits (FMAB), proposing a general framework and two models, with algorithms that achieve near-optimal regret while managing communication costs, validated through experiments.
Contribution
It presents a novel FMAB framework, develops Fed2-UCB for the approximate model, and analyzes the exact model, advancing federated bandit algorithms with theoretical guarantees.
Findings
Fed2-UCB achieves O(log(T)) regret in the approximate model.
Careful client admission reduces communication costs.
Order-optimal regret is achievable independently of client number.
Abstract
Federated multi-armed bandits (FMAB) is a new bandit paradigm that parallels the federated learning (FL) framework in supervised learning. It is inspired by practical applications in cognitive radio and recommender systems, and enjoys features that are analogous to FL. This paper proposes a general framework of FMAB and then studies two specific federated bandit models. We first study the approximate model where the heterogeneous local models are random realizations of the global model from an unknown distribution. This model introduces a new uncertainty of client sampling, as the global model may not be reliably learned even if the finite local models are perfectly known. Furthermore, this uncertainty cannot be quantified a priori without knowledge of the suboptimality gap. We solve the approximate model by proposing Federated Double UCB (Fed2-UCB), which constructs a novel "double…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques · Machine Learning and Algorithms
