Snap-Shot Decentralized Stochastic Gradient Tracking Methods
Haishan Ye, Xiangyu Chang

TL;DR
This paper introduces two new decentralized stochastic gradient tracking algorithms utilizing snapshot techniques, which are more robust to network topology variations and improve convergence properties over existing methods.
Contribution
The paper proposes two novel decentralized stochastic gradient tracking algorithms that are less affected by network topology, enhancing robustness and convergence guarantees compared to prior DSGT methods.
Findings
The new algorithms have improved iteration complexity bounds.
They are more robust to network topology variations.
The convergence is less dependent on network parameters.
Abstract
In decentralized optimization, agents form a network and only communicate with their neighbors, which gives advantages in data ownership, privacy, and scalability. At the same time, decentralized stochastic gradient descent (\texttt{SGD}) methods, as popular decentralized algorithms for training large-scale machine learning models, have shown their superiority over centralized counterparts. Distributed stochastic gradient tracking~(\texttt{DSGT})~\citep{pu2021distributed} has been recognized as the popular and state-of-the-art decentralized \texttt{SGD} method due to its proper theoretical guarantees. However, the theoretical analysis of \dsgt~\citep{koloskova2021improved} shows that its iteration complexity is , where is a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Distributed Control Multi-Agent Systems · Age of Information Optimization
