Dual Averaging for Distributed Optimization: Convergence Analysis and Network Scaling
John Duchi, Alekh Agarwal, Martin Wainwright

TL;DR
This paper introduces distributed dual averaging algorithms for decentralized convex optimization, providing convergence bounds that depend on network topology and size, with both theoretical analysis and simulations confirming the results.
Contribution
It develops and analyzes dual averaging-based distributed algorithms, offering sharp convergence bounds that explicitly relate to network spectral properties and scale with network size.
Findings
Convergence rate scales inversely with the spectral gap of the network.
Theoretical bounds are confirmed by simulations across various network topologies.
The approach applies to both deterministic and stochastic optimization scenarios.
Abstract
The goal of decentralized optimization over a network is to optimize a global objective formed by a sum of local (possibly nonsmooth) convex functions using only local computation and communication. It arises in various application domains, including distributed tracking and localization, multi-agent co-ordination, estimation in sensor networks, and large-scale optimization in machine learning. We develop and analyze distributed algorithms based on dual averaging of subgradients, and we provide sharp bounds on their convergence rates as a function of the network size and topology. Our method of analysis allows for a clear separation between the convergence of the optimization algorithm itself and the effects of communication constraints arising from the network structure. In particular, we show that the number of iterations required by our algorithm scales inversely in the spectral gap…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
