The Hidden Vulnerability of Distributed Learning in Byzantium

El Mahdi El Mhamdi; Rachid Guerraoui; S\'ebastien Rouault

arXiv:1802.07927·stat.ML·July 19, 2018·473 cites

The Hidden Vulnerability of Distributed Learning in Byzantium

El Mahdi El Mhamdi, Rachid Guerraoui, S\'ebastien Rouault

PDF

Open Access 1 Repo

TL;DR

This paper reveals that existing Byzantine-resilient distributed learning methods are vulnerable in high dimensions, allowing attackers to significantly influence model convergence, and proposes Bulyan to mitigate this issue effectively.

Contribution

The paper demonstrates the limitations of current Byzantine-resilient schemes in high-dimensional settings and introduces Bulyan, a new aggregation rule that substantially reduces attacker's leverage.

Findings

01

Existing schemes leave a poisoning margin growing with dimension

02

Bulyan reduces attacker leverage to a narrow bound

03

Bulyan achieves robust convergence comparable to non-Byzantine training

Abstract

While machine learning is going through an era of celebrated success, concerns have been raised about the vulnerability of its backbone: stochastic gradient descent (SGD). Recent approaches have been proposed to ensure the robustness of distributed SGD against adversarial (Byzantine) workers sending poisoned gradients during the training phase. Some of these approaches have been proven Byzantine-resilient: they ensure the convergence of SGD despite the presence of a minority of adversarial workers. We show in this paper that convergence is not enough. In high dimension $d ≫ 1$ , an adver\-sary can build on the loss function's non-convexity to make SGD converge to ineffective models. More precisely, we bring to light that existing Byzantine-resilient schemes leave a margin of poisoning of $Ω (f (d))$ , where $f (d)$ increases at least like $d$ . Based on this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

vrt1shjwlkr/ndss21-model-poisoning
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Adversarial Robustness in Machine Learning · Privacy-Preserving Technologies in Data

MethodsStochastic Gradient Descent