Byzantine-Robust Distributed Learning: Towards Optimal Statistical Rates

Dong Yin; Yudong Chen; Kannan Ramchandran; Peter Bartlett

arXiv:1803.01498·cs.LG·February 26, 2021·543 cites

Byzantine-Robust Distributed Learning: Towards Optimal Statistical Rates

Dong Yin, Yudong Chen, Kannan Ramchandran, Peter Bartlett

PDF

Open Access 2 Repos

TL;DR

This paper develops and analyzes robust distributed learning algorithms resilient to Byzantine failures, achieving near-optimal statistical error rates and communication efficiency across various loss functions.

Contribution

It introduces provably robust distributed gradient descent algorithms based on median and trimmed mean, with optimal statistical rates and a communication-efficient median-based method.

Findings

01

Algorithms achieve order-optimal error rates for strongly convex losses.

02

Median-based method attains optimal error with only one communication round.

03

Robust algorithms perform well across convex, non-convex, and quadratic loss functions.

Abstract

In large-scale distributed learning, security issues have become increasingly important. Particularly in a decentralized environment, some computing units may behave abnormally, or even exhibit Byzantine failures -- arbitrary and potentially adversarial behavior. In this paper, we develop distributed learning algorithms that are provably robust against such failures, with a focus on achieving optimal statistical performance. A main result of this work is a sharp analysis of two robust distributed gradient descent algorithms based on median and trimmed mean operations, respectively. We prove statistical error rates for three kinds of population loss functions: strongly convex, non-strongly convex, and smooth non-convex. In particular, these algorithms are shown to achieve order-optimal statistical error rates for strongly convex losses. To achieve better communication efficiency, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDistributed Sensor Networks and Detection Algorithms · Bayesian Modeling and Causal Inference · Machine Learning and Algorithms