Fall of Empires: Breaking Byzantine-tolerant SGD by Inner Product   Manipulation

Cong Xie; Sanmi Koyejo; Indranil Gupta

arXiv:1903.03936·cs.LG·March 12, 2019·59 cites

Fall of Empires: Breaking Byzantine-tolerant SGD by Inner Product Manipulation

Cong Xie, Sanmi Koyejo, Indranil Gupta

PDF

Open Access 4 Repos

TL;DR

This paper demonstrates that existing Byzantine-tolerant aggregation methods for distributed SGD, such as coordinate-wise median and Krum, can be compromised through inner product manipulation attacks, both theoretically and empirically.

Contribution

It introduces new attack strategies that break two widely used Byzantine-tolerant methods, revealing vulnerabilities in current defenses.

Findings

01

Coordinate-wise median and Krum can be bypassed using inner product manipulation

02

Theoretical proofs confirm the effectiveness of the attacks

03

Empirical experiments validate the attack success in practice

Abstract

Recently, new defense techniques have been developed to tolerate Byzantine failures for distributed machine learning. The Byzantine model captures workers that behave arbitrarily, including malicious and compromised workers. In this paper, we break two prevailing Byzantine-tolerant techniques. Specifically we show robust aggregation methods for synchronous SGD -- coordinate-wise median and Krum -- can be broken using new attack strategies based on inner product manipulation. We prove our results theoretically, as well as show empirical validation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Age of Information Optimization

MethodsStochastic Gradient Descent