AET-SGD: Asynchronous Event-triggered Stochastic Gradient Descent
Nhuong Nguyen, Song Han

TL;DR
AET-SGD is an asynchronous event-triggered stochastic gradient descent framework that significantly reduces communication costs in distributed learning while maintaining good convergence and robustness against delays.
Contribution
It introduces a novel linear increasing sample size event-triggered threshold for asynchronous SGD, effectively reducing communication and mitigating delay impacts.
Findings
Achieves 44x to 120x reduction in communication cost.
Maintains convergence performance despite large delays.
Validated on multiple datasets including MNIST and CIFAR10.
Abstract
Communication cost is the main bottleneck for the design of effective distributed learning algorithms. Recently, event-triggered techniques have been proposed to reduce the exchanged information among compute nodes and thus alleviate the communication cost. However, most existing event-triggered approaches only consider heuristic event-triggered thresholds. They also ignore the impact of computation and network delay, which play an important role on the training performance. In this paper, we propose an Asynchronous Event-triggered Stochastic Gradient Descent (SGD) framework, called AET-SGD, to i) reduce the communication cost among the compute nodes, and ii) mitigate the impact of the delay. Compared with baseline event-triggered methods, AET-SGD employs a linear increasing sample size event-triggered threshold, and can significantly reduce the communication cost while keeping good…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Machine Learning and ELM · Privacy-Preserving Technologies in Data
