Fast Distributed Gradient Methods

Dusan Jakovetic; Joao Xavier; Jose M. F. Moura

arXiv:1112.2972·cs.IT·April 15, 2014

Fast Distributed Gradient Methods

Dusan Jakovetic, Joao Xavier, Jose M. F. Moura

PDF

TL;DR

This paper introduces two fast distributed gradient algorithms for convex optimization over networks, achieving improved convergence rates with explicit dependence on network parameters, and demonstrates their effectiveness through simulations.

Contribution

The paper proposes two novel distributed Nesterov gradient algorithms with enhanced convergence rates and explicit dependence on network size and topology, advancing distributed optimization methods.

Findings

01

Achieves convergence rates of O(log K / K) and O(1 / K^2)

02

Provides explicit constants depending on network size and topology

03

Demonstrates effectiveness through simulation examples

Abstract

We study distributed optimization problems when $N$ nodes minimize the sum of their individual costs subject to a common vector variable. The costs are convex, have Lipschitz continuous gradient (with constant $L$ ), and bounded gradient. We propose two fast distributed gradient algorithms based on the centralized Nesterov gradient algorithm and establish their convergence rates in terms of the per-node communications $K$ and the per-node gradient evaluations $k$ . Our first method, Distributed Nesterov Gradient, achieves rates $O (lo g K / K)$ and $O (lo g k / k)$ . Our second method, Distributed Nesterov gradient with Consensus iterations, assumes at all nodes knowledge of $L$ and $μ (W)$ -- the second largest singular value of the $N \times N$ doubly stochastic weight matrix $W$ . It achieves rates…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.