Optimization Trade-offs in Asynchronous Federated Learning: A Stochastic Networks Approach
Abdelkrim Alahyane (LAAS-SARA), C\'eline Comte (CNRS, LAAS-SARA), Matthieu Jonckheere (CNRS, LAAS-SARA)

TL;DR
This paper develops a stochastic queueing-network model for asynchronous federated learning, providing closed-form expressions for throughput, convergence, and energy trade-offs, and proposes optimization strategies to improve efficiency.
Contribution
It introduces a novel analytical framework that captures queueing dynamics in asynchronous federated learning, enabling explicit trade-off analysis and optimization.
Findings
Closed-form expression for update throughput.
Quantified trade-offs between staleness, convergence speed, and energy.
Experimental results show significant reductions in convergence time and energy consumption.
Abstract
Synchronous federated learning scales poorly due to the straggler effect. Asynchronous algorithms increase the update throughput by processing updates upon arrival, but they introduce two fundamental challenges: gradient staleness, which degrades convergence, and bias toward faster clients under heterogeneous data distributions. Although algorithms such as AsyncSGD and Generalized AsyncSGD mitigate this bias via client-side task queues, most existing analyses neglect the underlying queueing dynamics and lack closed-form characterizations of the update throughput and gradient staleness. To close this gap, we develop a stochastic queueing-network framework for Generalized AsyncSGD that jointly models random computation times at the clients and the central server, as well as random uplink and downlink communication delays. Leveraging product-form network theory, we derive a closed-form…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
