Stochastic Momentum Tracking Push-Pull for Decentralized Optimization over Directed Graphs
Wenqi Fan, Yiwei Liao, Qing Xu, Bin Guo, and Songyi Dian

TL;DR
The paper introduces SMTPP, a decentralized optimization algorithm for directed graphs that reduces variance and oscillations, ensuring convergence and robust performance across various network connectivities.
Contribution
SMTPP tracks momentum instead of raw gradients, decoupling variance reduction from network topology and guaranteeing convergence on strongly connected directed graphs.
Findings
SMTPP achieves convergence rates close to centralized baselines.
The algorithm is robust to network sparsity and density.
SMTPP effectively dampens topology-induced oscillations.
Abstract
Decentralized optimization over directed networks is frequently challenged by asymmetric communication and the inherent high variance of stochastic gradients, which collectively cause severe oscillations and hinder algorithmic convergence. To address these challenges, we propose the Stochastic Momentum Tracking Push-Pull (SMTPP) algorithm, which tracks the momentum term rather than raw stochastic gradients within the Push-Pull architecture. This design successfully decouples the variance reduction capacity from the algebraic connectivity of the graph.Although the inherent topology mismatch of directed graphs precludes exact convergence under persistent stochastic noise, SMTPP rigorously compresses this unavoidable steady-state error floor into a minimal neighborhood determined by network connectivity and gradient variance. Furthermore, SMTPP guarantees convergence on any strongly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
