Byzantine-Robust Decentralized Stochastic Optimization with Stochastic Gradient Noise-Independent Learning Error
Jie Peng, Weiyu Li, Qing Ling

TL;DR
This paper introduces two variance reduction methods, BRAVO-SAGA and BRAVO-LSVRG, for Byzantine-robust decentralized stochastic optimization that achieve linear convergence and noise-independent learning errors, improving robustness against adversarial agents.
Contribution
The paper proposes novel variance reduction algorithms, BRAVO-SAGA and BRAVO-LSVRG, that attain both linear convergence and minimal learning error unaffected by stochastic gradient noise in Byzantine-robust decentralized optimization.
Findings
Methods achieve linear convergence speeds.
Learning errors are independent of stochastic gradient noise.
Effective under various Byzantine attack scenarios.
Abstract
This paper studies Byzantine-robust stochastic optimization over a decentralized network, where every agent periodically communicates with its neighbors to exchange local models, and then updates its own local model by stochastic gradient descent (SGD). The performance of such a method is affected by an unknown number of Byzantine agents, which conduct adversarially during the optimization process. To the best of our knowledge, there is no existing work that simultaneously achieves a linear convergence speed and a small learning error. We observe that the learning error is largely dependent on the intrinsic stochastic gradient noise. Motivated by this observation, we introduce two variance reduction methods, stochastic average gradient algorithm (SAGA) and loopless stochastic variance-reduced gradient (LSVRG), to Byzantine-robust decentralized stochastic optimization for eliminating the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Distributed Control Multi-Agent Systems · Privacy-Preserving Technologies in Data
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
