Row-stochastic matrices can provably outperform doubly stochastic matrices in decentralized learning
Bing Liu, Boao Kong, Limin Lu, Kun Yuan, and Chengcheng Zhao

TL;DR
This paper demonstrates that row-stochastic matrices can outperform doubly stochastic matrices in decentralized learning by analyzing convergence in a weighted Hilbert space, revealing new geometric insights.
Contribution
The authors develop a weighted Hilbert-space framework to compare stochastic matrices, showing conditions where row-stochastic matrices converge faster despite smaller spectral gaps.
Findings
Row-stochastic matrices can have tighter convergence guarantees than doubly stochastic matrices.
The weighted Hilbert-space analysis reveals additional penalty terms affecting convergence.
Topology conditions are derived to ensure row-stochastic matrices outperform doubly stochastic matrices.
Abstract
Decentralized learning often involves a weighted global loss with heterogeneous node weights . We revisit two natural strategies for incorporating these weights: (i) embedding them into the local losses to retain a uniform weight (and thus a doubly stochastic matrix), and (ii) keeping the original losses while employing a -induced row-stochastic matrix. Although prior work shows that both strategies yield the same expected descent direction for the global loss, it remains unclear whether the Euclidean-space guarantees are tight and what fundamentally differentiates their behaviors. To clarify this, we develop a weighted Hilbert-space framework and obtain convergence rates that are strictly tighter than those from Euclidean analysis. In this geometry, the row-stochastic matrix becomes self-adjoint whereas the doubly stochastic one does not,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
