DNN gradient lossless compression: Can GenNorm be the answer?
Zhong-Jing Chen, Eduin E. Hernandez, Yu-Chih Huang, Stefano Rini

TL;DR
This paper proposes modeling DNN gradient distributions with generalized normal (GenNorm) distribution to improve lossless compression efficiency in distributed training, especially in federated learning scenarios.
Contribution
It introduces the GenNorm distribution as a more accurate model for DNN gradients, leading to better lossless compression performance compared to previous models.
Findings
GenNorm models provide a better fit for gradient tail distributions.
Applying classical coding algorithms to GenNorm-modeled gradients improves compression.
The approach offers low complexity and practical benefits in distributed DNN training.
Abstract
In this paper, the problem of optimal gradient lossless compression in Deep Neural Network (DNN) training is considered. Gradient compression is relevant in many distributed DNN training scenarios, including the recently popular federated learning (FL) scenario in which each remote users are connected to the parameter server (PS) through a noiseless but rate limited channel. In distributed DNN training, if the underlying gradient distribution is available, classical lossless compression approaches can be used to reduce the number of bits required for communicating the gradient entries. Mean field analysis has suggested that gradient updates can be considered as independent random variables, while Laplace approximation can be used to argue that gradient has a distribution approximating the normal (Norm) distribution in some regimes. In this paper we argue that, for some networks of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · COVID-19 diagnosis using AI · Advanced Neural Network Applications
