Distributed Statistical Machine Learning in Adversarial Settings:   Byzantine Gradient Descent

Yudong Chen; Lili Su; Jiaming Xu

arXiv:1705.05491·cs.DC·October 24, 2017·143 cites

Distributed Statistical Machine Learning in Adversarial Settings: Byzantine Gradient Descent

Yudong Chen, Lili Su, Jiaming Xu

PDF

Open Access 2 Repos

TL;DR

This paper introduces a Byzantine-resilient distributed gradient descent algorithm for federated learning, capable of tolerating malicious machine failures and achieving near-optimal estimation accuracy with provable convergence guarantees.

Contribution

It proposes a robust gradient aggregation method based on the geometric median of means, improving fault tolerance in distributed learning systems.

Findings

01

Tolerates up to (m-1)/2 Byzantine failures

02

Achieves convergence in O(log N) rounds with near-optimal error rate

03

Provides theoretical guarantees for gradient convergence despite adversarial failures

Abstract

We consider the problem of distributed statistical machine learning in adversarial settings, where some unknown and time-varying subset of working machines may be compromised and behave arbitrarily to prevent an accurate model from being learned. This setting captures the potential adversarial attacks faced by Federated Learning -- a modern machine learning paradigm that is proposed by Google researchers and has been intensively studied for ensuring user privacy. Formally, we focus on a distributed system consisting of a parameter server and $m$ working machines. Each working machine keeps $N / m$ data samples, where $N$ is the total number of samples. The goal is to collectively learn the underlying true model parameter of dimension $d$ . In classical batch gradient descent methods, the gradients reported to the server by the working machines are aggregated via simple averaging, which…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Privacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning

MethodsLinear Regression