Proving the Limited Scalability of Centralized Distributed Optimization via a New Lower Bound Construction

Alexander Tyurin

arXiv:2506.23836·math.OC·March 31, 2026

Proving the Limited Scalability of Centralized Distributed Optimization via a New Lower Bound Construction

Alexander Tyurin

PDF

1 Video

TL;DR

This paper establishes fundamental limitations on the scalability of centralized distributed optimization in federated learning, showing that communication and variance constraints prevent significant improvements beyond poly-logarithmic speedup with increasing workers.

Contribution

The authors introduce a new lower bound framework and a worst-case function to prove inherent scalability limits in distributed optimization with unbiased sparsification.

Findings

01

Communication from server to workers limits scalability improvements.

02

Variance reduction techniques cannot significantly outperform poly-logarithmic scaling.

03

New lower bound framework and concentration bounds underpin the theoretical results.

Abstract

We consider centralized distributed optimization in the classical federated learning setup, where $n$ workers jointly find an $ε$ -stationary point of an $L$ -smooth, $d$ -dimensional nonconvex function $f$ , having access only to unbiased stochastic gradients with variance $σ^{2}$ . Each worker requires at most $h$ seconds to compute a stochastic gradient, and the communication times from the server to the workers and from the workers to the server are $τ_{s}$ and $τ_{w}$ seconds per coordinate, respectively. One of the main motivations for distributed optimization is to achieve scalability with respect to $n$ . For instance, it is well known that the distributed version of SGD has a variance-dependent runtime term $\frac{h σ ^{2} L Δ}{n ε ^{2}},$ which improves with the number of workers $n,$ where $Δ = f (x^{0}) - f^{*},$ and $x^{0} \in R^{d}$ is the starting…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Proving the Limited Scalability of Centralized Distributed Optimization via a New Lower Bound Construction· slideslive