Stability and Generalization of Push-Sum Based Decentralized Optimization over Directed Graphs

Yifei Liang; Yan Sun; Xiaochun Cao; Li Shen

arXiv:2602.20567·cs.LG·February 25, 2026

Stability and Generalization of Push-Sum Based Decentralized Optimization over Directed Graphs

Yifei Liang, Yan Sun, Xiaochun Cao, Li Shen

PDF

Open Access

TL;DR

This paper develops a stability framework for Push-Sum-based decentralized optimization over directed graphs, analyzing how topology and imbalance affect convergence and generalization in both convex and non-convex settings.

Contribution

It introduces a unified stability analysis capturing the effects of directed topology, imbalance, and mixing speed on decentralized learning performance.

Findings

01

Establishes finite-iteration stability and generalization bounds for Push-Sum algorithms.

02

Characterizes the impact of imbalance and spectral gap on convergence rates.

03

Provides conditions when Push-Sum correction is necessary versus standard decentralized SGD.

Abstract

Push-Sum-based decentralized learning enables optimization over directed communication networks, where information exchange may be asymmetric. While convergence properties of such methods are well understood, their finite-iteration stability and generalization behavior remain unclear due to structural bias induced by column-stochastic mixing and asymmetric error propagation. In this work, we develop a unified uniform-stability framework for the Stochastic Gradient Push (SGP) algorithm that captures the effect of directed topology. A key technical ingredient is an imbalance-aware consistency bound for Push-Sum, which controls consensus deviation through two quantities: the stationary distribution imbalance parameter $δ$ and the spectral gap $(1 - λ)$ governing mixing speed. This decomposition enables us to disentangle statistical effects from topology-induced bias. We establish…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDistributed Control Multi-Agent Systems · Stochastic Gradient Optimization Techniques · Privacy-Preserving Technologies in Data