CEDAS: A Compressed Decentralized Stochastic Gradient Method with   Improved Convergence

Kun Huang; Shi Pu

arXiv:2301.05872·math.OC·October 1, 2024

CEDAS: A Compressed Decentralized Stochastic Gradient Method with Improved Convergence

Kun Huang, Shi Pu

PDF

Open Access

TL;DR

This paper introduces CEDAS, a compressed decentralized stochastic gradient method that achieves near-centralized convergence rates with minimal transient time, improving efficiency in communication-restricted distributed optimization.

Contribution

CEDAS is the first method to attain the convergence rate of centralized SGD with the shortest known transient time in decentralized settings under compression.

Findings

01

CEDAS achieves convergence rates comparable to centralized SGD.

02

CEDAS has the shortest transient time among decentralized methods.

03

Numerical experiments confirm the effectiveness of CEDAS.

Abstract

In this paper, we consider solving the distributed optimization problem over a multi-agent network under the communication restricted setting. We study a compressed decentralized stochastic gradient method, termed ``compressed exact diffusion with adaptive stepsizes (CEDAS)", and show the method asymptotically achieves comparable convergence rate as centralized { stochastic gradient descent (SGD)} for both smooth strongly convex objective functions and smooth nonconvex objective functions under unbiased compression operators. In particular, to our knowledge, CEDAS enjoys so far the shortest transient time (with respect to the graph specifics) for achieving the convergence rate of centralized SGD, which behaves as $O (n C^{3} / (1 - λ_{2})^{2})$ under smooth strongly convex objective functions, and $O (n^{3} C^{6} / (1 - λ_{2})^{4})$ under smooth nonconvex objective…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Advanced Neuroimaging Techniques and Applications

MethodsStochastic Gradient Descent · Diffusion