Federated Multi-Armed Bandits

Chengshuai Shi; Cong Shen

arXiv:2101.12204·cs.LG·March 4, 2021·6 cites

Federated Multi-Armed Bandits

Chengshuai Shi, Cong Shen

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces federated multi-armed bandits (FMAB), proposing a general framework and two models, with algorithms that achieve near-optimal regret while managing communication costs, validated through experiments.

Contribution

It presents a novel FMAB framework, develops Fed2-UCB for the approximate model, and analyzes the exact model, advancing federated bandit algorithms with theoretical guarantees.

Findings

01

Fed2-UCB achieves O(log(T)) regret in the approximate model.

02

Careful client admission reduces communication costs.

03

Order-optimal regret is achievable independently of client number.

Abstract

Federated multi-armed bandits (FMAB) is a new bandit paradigm that parallels the federated learning (FL) framework in supervised learning. It is inspired by practical applications in cognitive radio and recommender systems, and enjoys features that are analogous to FL. This paper proposes a general framework of FMAB and then studies two specific federated bandit models. We first study the approximate model where the heterogeneous local models are random realizations of the global model from an unknown distribution. This model introduces a new uncertainty of client sampling, as the global model may not be reliably learned even if the finite local models are perfectly known. Furthermore, this uncertainty cannot be quantified a priori without knowledge of the suboptimality gap. We solve the approximate model by proposing Federated Double UCB (Fed2-UCB), which constructs a novel "double…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ShenGroup/FMAB
noneOfficial

Videos

Federated Multi-Armed Bandits· underline

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques · Machine Learning and Algorithms