Straggler-Resilient Decentralized Learning via Adaptive Asynchronous   Updates

Guojun Xiong; Gang Yan; Shiqiang Wang; Jian Li

arXiv:2306.06559·cs.LG·July 10, 2024·2 cites

Straggler-Resilient Decentralized Learning via Adaptive Asynchronous Updates

Guojun Xiong, Gang Yan, Shiqiang Wang, Jian Li

PDF

Open Access

TL;DR

This paper introduces DSGD-AAU, a decentralized learning algorithm with adaptive asynchronous updates that mitigates straggler effects, achieves linear convergence speedup, and is validated through extensive experiments.

Contribution

The paper proposes a novel decentralized algorithm with adaptive asynchronous updates that reduces straggler impact and improves convergence speed.

Findings

01

Achieves linear speedup in convergence

02

Effectively mitigates straggler effects

03

Validated through extensive experiments

Abstract

With the increasing demand for large-scale training of machine learning models, fully decentralized optimization methods have recently been advocated as alternatives to the popular parameter server framework. In this paradigm, each worker maintains a local estimate of the optimal parameter vector, and iteratively updates it by waiting and averaging all estimates obtained from its neighbors, and then corrects it on the basis of its local dataset. However, the synchronization phase is sensitive to stragglers. An efficient way to mitigate this effect is to consider asynchronous updates, where each worker computes stochastic gradients and communicates with other workers at its own pace. Unfortunately, fully asynchronous updates suffer from staleness of stragglers' parameters. To address these limitations, we propose a fully decentralized algorithm DSGD-AAU with adaptive asynchronous updates…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Age of Information Optimization · Privacy-Preserving Technologies in Data