An Inter-Layer Weight Prediction and Quantization for Deep Neural Networks based on a Smoothly Varying Weight Hypothesis
Kang-Ho Lee, JoonHyun Jeong, and Sung-Ho Bae

TL;DR
This paper introduces a novel neural network compression technique leveraging inter-layer weight prediction, the Smoothly Varying Weight Hypothesis, and an inter-layer loss, significantly improving compression rates while maintaining accuracy.
Contribution
It proposes a new compression method combining inter-layer weight prediction, a hypothesis on weight smoothness across layers, and a specialized loss to enhance neural network compression.
Findings
Achieves higher compression rates at the same accuracy level.
Effectively utilizes weight residuals following Laplace distributions.
Enhances storage efficiency by eliminating non-texture bits.
Abstract
Due to a resource-constrained environment, network compression has become an important part of deep neural networks research. In this paper, we propose a new compression method, \textit{Inter-Layer Weight Prediction} (ILWP) and quantization method which quantize the predicted residuals between the weights in all convolution layers based on an inter-frame prediction method in conventional video coding schemes. Furthermore, we found a phenomenon \textit{Smoothly Varying Weight Hypothesis} (SVWH) which is that the weights in adjacent convolution layers share strong similarity in shapes and values, i.e., the weights tend to vary smoothly along with the layers. Based on SVWH, we propose a second ILWP and quantization method which quantize the predicted residuals between the weights in adjacent convolution layers. Since the predicted weight residuals tend to follow Laplace distributions with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image Processing Techniques · Image Enhancement Techniques
MethodsConvolution
