Lightweight Projective Derivative Codes for Compressed Asynchronous Gradient Descent
Pedro Soto, Ilia Ilmer, Haibin Guan, Jun Li

TL;DR
This paper introduces a novel lossy compression coding scheme for distributed gradient descent that encodes derivatives directly, enabling asynchronous updates and robustness in machine learning training.
Contribution
It proposes a new coding algorithm that encodes partial derivatives with lossy compression, allowing asynchronous updates and robustness in gradient descent.
Findings
Enables asynchronous gradient updates through low-weight codes.
Maximizes information in codewords while minimizing inter-codeword information.
Applicable to general machine learning frameworks like deep neural networks.
Abstract
Coded distributed computation has become common practice for performing gradient descent on large datasets to mitigate stragglers and other faults. This paper proposes a novel algorithm that encodes the partial derivatives themselves and furthermore optimizes the codes by performing lossy compression on the derivative codewords by maximizing the information contained in the codewords while minimizing the information between the codewords. The utility of this application of coding theory is a geometrical consequence of the observed fact in optimization research that noise is tolerable, sometimes even helpful, in gradient descent based learning algorithms since it helps avoid overfitting and local minima. This stands in contrast with much current conventional work on distributed coded computation which focuses on recovering all of the data from the workers. A second further contribution…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Privacy-Preserving Technologies in Data
