Adjacent Leader Decentralized Stochastic Gradient Descent
Haoze He, Jing Wang, Anna Choromanska

TL;DR
This paper introduces AL-DSGD, a novel decentralized gradient descent method that enhances model performance and convergence speed in communication-limited environments by adaptive weighting and dynamic communication graphs.
Contribution
The paper proposes AL-DSGD, a new decentralized optimization algorithm that improves convergence and performance through adaptive neighbor influence and dynamic graph communication.
Findings
AL-DSGD accelerates convergence compared to state-of-the-art methods.
AL-DSGD improves test performance in communication-constrained settings.
Theoretical convergence of AL-DSGD is established.
Abstract
This work focuses on the decentralized deep learning optimization framework. We propose Adjacent Leader Decentralized Gradient Descent (AL-DSGD), for improving final model performance, accelerating convergence, and reducing the communication overhead of decentralized deep learning optimizers. AL-DSGD relies on two main ideas. Firstly, to increase the influence of the strongest learners on the learning system it assigns weights to different neighbor workers according to both their performance and the degree when averaging among them, and it applies a corrective force on the workers dictated by both the currently best-performing neighbor and the neighbor with the maximal degree. Secondly, to alleviate the problem of the deterioration of the convergence speed and performance of the nodes with lower degrees, AL-DSGD relies on dynamic communication graphs, which effectively allows the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed Control Multi-Agent Systems · Molecular Communication and Nanonetworks
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Lib
