Byzantine-Resilient High-Dimensional Federated Learning
Deepesh Data, Suhas Diggavi

TL;DR
This paper introduces a Byzantine-resilient federated learning algorithm that uses robust mean estimation to filter malicious updates, providing convergence guarantees even with heterogeneous data and local SGD iterations.
Contribution
It presents the first Byzantine-resilient federated learning method with local iterations, combining robust mean estimation and novel matrix concentration analysis.
Findings
Achieves convergence under Byzantine attacks with local SGD.
Handles heterogeneous data without probabilistic assumptions.
Extends to full-batch gradient scenarios.
Abstract
We study stochastic gradient descent (SGD) with local iterations in the presence of malicious/Byzantine clients, motivated by the federated learning. The clients, instead of communicating with the central server in every iteration, maintain their local models, which they update by taking several SGD iterations based on their own datasets and then communicate the net update with the server, thereby achieving communication-efficiency. Furthermore, only a subset of clients communicate with the server, and this subset may be different at different synchronization times. The Byzantine clients may collaborate and send arbitrary vectors to the server to disrupt the learning process. To combat the adversary, we employ an efficient high-dimensional robust mean estimation algorithm from Steinhardt et al.~\cite[ITCS 2018]{Resilience_SCV18} at the server to filter-out corrupt vectors; and to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Privacy-Preserving Technologies in Data · Random Matrices and Applications
MethodsStochastic Gradient Descent
