Distributed Statistical Machine Learning in Adversarial Settings: Byzantine Gradient Descent
Yudong Chen, Lili Su, Jiaming Xu

TL;DR
This paper introduces a Byzantine-resilient distributed gradient descent algorithm for federated learning, capable of tolerating malicious machine failures and achieving near-optimal estimation accuracy with provable convergence guarantees.
Contribution
It proposes a robust gradient aggregation method based on the geometric median of means, improving fault tolerance in distributed learning systems.
Findings
Tolerates up to (m-1)/2 Byzantine failures
Achieves convergence in O(log N) rounds with near-optimal error rate
Provides theoretical guarantees for gradient convergence despite adversarial failures
Abstract
We consider the problem of distributed statistical machine learning in adversarial settings, where some unknown and time-varying subset of working machines may be compromised and behave arbitrarily to prevent an accurate model from being learned. This setting captures the potential adversarial attacks faced by Federated Learning -- a modern machine learning paradigm that is proposed by Google researchers and has been intensively studied for ensuring user privacy. Formally, we focus on a distributed system consisting of a parameter server and working machines. Each working machine keeps data samples, where is the total number of samples. The goal is to collectively learn the underlying true model parameter of dimension . In classical batch gradient descent methods, the gradients reported to the server by the working machines are aggregated via simple averaging, which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Privacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning
MethodsLinear Regression
