Byzantine-Robust Decentralized Stochastic Optimization with Stochastic   Gradient Noise-Independent Learning Error

Jie Peng; Weiyu Li; Qing Ling

arXiv:2308.05292·cs.LG·August 11, 2023

Byzantine-Robust Decentralized Stochastic Optimization with Stochastic Gradient Noise-Independent Learning Error

Jie Peng, Weiyu Li, Qing Ling

PDF

Open Access

TL;DR

This paper introduces two variance reduction methods, BRAVO-SAGA and BRAVO-LSVRG, for Byzantine-robust decentralized stochastic optimization that achieve linear convergence and noise-independent learning errors, improving robustness against adversarial agents.

Contribution

The paper proposes novel variance reduction algorithms, BRAVO-SAGA and BRAVO-LSVRG, that attain both linear convergence and minimal learning error unaffected by stochastic gradient noise in Byzantine-robust decentralized optimization.

Findings

01

Methods achieve linear convergence speeds.

02

Learning errors are independent of stochastic gradient noise.

03

Effective under various Byzantine attack scenarios.

Abstract

This paper studies Byzantine-robust stochastic optimization over a decentralized network, where every agent periodically communicates with its neighbors to exchange local models, and then updates its own local model by stochastic gradient descent (SGD). The performance of such a method is affected by an unknown number of Byzantine agents, which conduct adversarially during the optimization process. To the best of our knowledge, there is no existing work that simultaneously achieves a linear convergence speed and a small learning error. We observe that the learning error is largely dependent on the intrinsic stochastic gradient noise. Motivated by this observation, we introduce two variance reduction methods, stochastic average gradient algorithm (SAGA) and loopless stochastic variance-reduced gradient (LSVRG), to Byzantine-robust decentralized stochastic optimization for eliminating the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Distributed Control Multi-Agent Systems · Privacy-Preserving Technologies in Data

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings