Row-stochastic matrices can provably outperform doubly stochastic matrices in decentralized learning

Bing Liu; Boao Kong; Limin Lu; Kun Yuan; and Chengcheng Zhao

arXiv:2511.19513·cs.LG·May 7, 2026

Row-stochastic matrices can provably outperform doubly stochastic matrices in decentralized learning

Bing Liu, Boao Kong, Limin Lu, Kun Yuan, and Chengcheng Zhao

PDF

TL;DR

This paper demonstrates that row-stochastic matrices can outperform doubly stochastic matrices in decentralized learning by analyzing convergence in a weighted Hilbert space, revealing new geometric insights.

Contribution

The authors develop a weighted Hilbert-space framework to compare stochastic matrices, showing conditions where row-stochastic matrices converge faster despite smaller spectral gaps.

Findings

01

Row-stochastic matrices can have tighter convergence guarantees than doubly stochastic matrices.

02

The weighted Hilbert-space analysis reveals additional penalty terms affecting convergence.

03

Topology conditions are derived to ensure row-stochastic matrices outperform doubly stochastic matrices.

Abstract

Decentralized learning often involves a weighted global loss with heterogeneous node weights $λ$ . We revisit two natural strategies for incorporating these weights: (i) embedding them into the local losses to retain a uniform weight (and thus a doubly stochastic matrix), and (ii) keeping the original losses while employing a $λ$ -induced row-stochastic matrix. Although prior work shows that both strategies yield the same expected descent direction for the global loss, it remains unclear whether the Euclidean-space guarantees are tight and what fundamentally differentiates their behaviors. To clarify this, we develop a weighted Hilbert-space framework $L^{2} (λ; R^{d})$ and obtain convergence rates that are strictly tighter than those from Euclidean analysis. In this geometry, the row-stochastic matrix becomes self-adjoint whereas the doubly stochastic one does not,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.