Snap-Shot Decentralized Stochastic Gradient Tracking Methods

Haishan Ye; Xiangyu Chang

arXiv:2212.05273·math.OC·December 13, 2022

Snap-Shot Decentralized Stochastic Gradient Tracking Methods

Haishan Ye, Xiangyu Chang

PDF

Open Access

TL;DR

This paper introduces two new decentralized stochastic gradient tracking algorithms utilizing snapshot techniques, which are more robust to network topology variations and improve convergence properties over existing methods.

Contribution

The paper proposes two novel decentralized stochastic gradient tracking algorithms that are less affected by network topology, enhancing robustness and convergence guarantees compared to prior DSGT methods.

Findings

01

The new algorithms have improved iteration complexity bounds.

02

They are more robust to network topology variations.

03

The convergence is less dependent on network parameters.

Abstract

In decentralized optimization, $m$ agents form a network and only communicate with their neighbors, which gives advantages in data ownership, privacy, and scalability. At the same time, decentralized stochastic gradient descent (\texttt{SGD}) methods, as popular decentralized algorithms for training large-scale machine learning models, have shown their superiority over centralized counterparts. Distributed stochastic gradient tracking~(\texttt{DSGT})~\citep{pu2021distributed} has been recognized as the popular and state-of-the-art decentralized \texttt{SGD} method due to its proper theoretical guarantees. However, the theoretical analysis of \dsgt~\citep{koloskova2021improved} shows that its iteration complexity is $\tilde{O} (\frac{σ ˉ ^{2}}{m μ ε} + \frac{L σ ˉ}{μ ( 1 - λ _{2} ( W ) ) ^{1/2} C _{W} ε})$ , where $W$ is a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Distributed Control Multi-Agent Systems · Age of Information Optimization