Byzantine-Resilient High-Dimensional Federated Learning

Deepesh Data; Suhas Diggavi

arXiv:2006.13041·stat.ML·August 18, 2020·1 cites

Byzantine-Resilient High-Dimensional Federated Learning

Deepesh Data, Suhas Diggavi

PDF

Open Access

TL;DR

This paper introduces a Byzantine-resilient federated learning algorithm that uses robust mean estimation to filter malicious updates, providing convergence guarantees even with heterogeneous data and local SGD iterations.

Contribution

It presents the first Byzantine-resilient federated learning method with local iterations, combining robust mean estimation and novel matrix concentration analysis.

Findings

01

Achieves convergence under Byzantine attacks with local SGD.

02

Handles heterogeneous data without probabilistic assumptions.

03

Extends to full-batch gradient scenarios.

Abstract

We study stochastic gradient descent (SGD) with local iterations in the presence of malicious/Byzantine clients, motivated by the federated learning. The clients, instead of communicating with the central server in every iteration, maintain their local models, which they update by taking several SGD iterations based on their own datasets and then communicate the net update with the server, thereby achieving communication-efficiency. Furthermore, only a subset of clients communicate with the server, and this subset may be different at different synchronization times. The Byzantine clients may collaborate and send arbitrary vectors to the server to disrupt the learning process. To combat the adversary, we employ an efficient high-dimensional robust mean estimation algorithm from Steinhardt et al.~\cite[ITCS 2018]{Resilience_SCV18} at the server to filter-out corrupt vectors; and to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Privacy-Preserving Technologies in Data · Random Matrices and Applications

MethodsStochastic Gradient Descent