AET-SGD: Asynchronous Event-triggered Stochastic Gradient Descent

Nhuong Nguyen; Song Han

arXiv:2112.13935·cs.LG·December 30, 2021

AET-SGD: Asynchronous Event-triggered Stochastic Gradient Descent

Nhuong Nguyen, Song Han

PDF

Open Access 1 Repo

TL;DR

AET-SGD is an asynchronous event-triggered stochastic gradient descent framework that significantly reduces communication costs in distributed learning while maintaining good convergence and robustness against delays.

Contribution

It introduces a novel linear increasing sample size event-triggered threshold for asynchronous SGD, effectively reducing communication and mitigating delay impacts.

Findings

01

Achieves 44x to 120x reduction in communication cost.

02

Maintains convergence performance despite large delays.

03

Validated on multiple datasets including MNIST and CIFAR10.

Abstract

Communication cost is the main bottleneck for the design of effective distributed learning algorithms. Recently, event-triggered techniques have been proposed to reduce the exchanged information among compute nodes and thus alleviate the communication cost. However, most existing event-triggered approaches only consider heuristic event-triggered thresholds. They also ignore the impact of computation and network delay, which play an important role on the training performance. In this paper, we propose an Asynchronous Event-triggered Stochastic Gradient Descent (SGD) framework, called AET-SGD, to i) reduce the communication cost among the compute nodes, and ii) mitigate the impact of the delay. Compared with baseline event-triggered methods, AET-SGD employs a linear increasing sample size event-triggered threshold, and can significantly reduce the communication cost while keeping good…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mlconference/aet-sgd
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Machine Learning and ELM · Privacy-Preserving Technologies in Data