Stability and Generalization of Push-Sum Based Decentralized Optimization over Directed Graphs
Yifei Liang, Yan Sun, Xiaochun Cao, Li Shen

TL;DR
This paper develops a stability framework for Push-Sum-based decentralized optimization over directed graphs, analyzing how topology and imbalance affect convergence and generalization in both convex and non-convex settings.
Contribution
It introduces a unified stability analysis capturing the effects of directed topology, imbalance, and mixing speed on decentralized learning performance.
Findings
Establishes finite-iteration stability and generalization bounds for Push-Sum algorithms.
Characterizes the impact of imbalance and spectral gap on convergence rates.
Provides conditions when Push-Sum correction is necessary versus standard decentralized SGD.
Abstract
Push-Sum-based decentralized learning enables optimization over directed communication networks, where information exchange may be asymmetric. While convergence properties of such methods are well understood, their finite-iteration stability and generalization behavior remain unclear due to structural bias induced by column-stochastic mixing and asymmetric error propagation. In this work, we develop a unified uniform-stability framework for the Stochastic Gradient Push (SGP) algorithm that captures the effect of directed topology. A key technical ingredient is an imbalance-aware consistency bound for Push-Sum, which controls consensus deviation through two quantities: the stationary distribution imbalance parameter and the spectral gap governing mixing speed. This decomposition enables us to disentangle statistical effects from topology-induced bias. We establish…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed Control Multi-Agent Systems · Stochastic Gradient Optimization Techniques · Privacy-Preserving Technologies in Data
