Fast Distributed Gradient Methods
Dusan Jakovetic, Joao Xavier, Jose M. F. Moura

TL;DR
This paper introduces two fast distributed gradient algorithms for convex optimization over networks, achieving improved convergence rates with explicit dependence on network parameters, and demonstrates their effectiveness through simulations.
Contribution
The paper proposes two novel distributed Nesterov gradient algorithms with enhanced convergence rates and explicit dependence on network size and topology, advancing distributed optimization methods.
Findings
Achieves convergence rates of O(log K / K) and O(1 / K^2)
Provides explicit constants depending on network size and topology
Demonstrates effectiveness through simulation examples
Abstract
We study distributed optimization problems when nodes minimize the sum of their individual costs subject to a common vector variable. The costs are convex, have Lipschitz continuous gradient (with constant ), and bounded gradient. We propose two fast distributed gradient algorithms based on the centralized Nesterov gradient algorithm and establish their convergence rates in terms of the per-node communications and the per-node gradient evaluations . Our first method, Distributed Nesterov Gradient, achieves rates and . Our second method, Distributed Nesterov gradient with Consensus iterations, assumes at all nodes knowledge of and -- the second largest singular value of the doubly stochastic weight matrix . It achieves rates…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
