Deep Learning with Low Precision by Half-wave Gaussian Quantization
Zhaowei Cai, Xiaodong He, Jian Sun, Nuno Vasconcelos

TL;DR
This paper introduces a low-precision neural network quantization method using Half-wave Gaussian Quantization (HWGQ), which closely approximates full-precision networks and is efficiently implementable, improving performance over previous low-precision approaches.
Contribution
The paper proposes HWGQ for forward activation approximation and investigates backward approximators, resulting in HWGQ-Net that significantly improves low-precision neural network performance.
Findings
HWGQ-Net achieves performance close to full-precision models.
Efficient implementation leveraging activation statistics and batch normalization.
Outperforms previous low-precision networks with binary weights and 2-bit activations.
Abstract
The problem of quantizing the activations of a deep neural network is considered. An examination of the popular binary quantization approach shows that this consists of approximating a classical non-linearity, the hyperbolic tangent, by two functions: a piecewise constant sign function, which is used in feedforward network computations, and a piecewise linear hard tanh function, used in the backpropagation step during network learning. The problem of approximating the ReLU non-linearity, widely used in the recent deep learning literature, is then considered. An half-wave Gaussian quantizer (HWGQ) is proposed for forward approximation and shown to have efficient implementation, by exploiting the statistics of of network activations and batch normalization operations commonly used in the literature. To overcome the problem of gradient mismatch, due to the use of different forward and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Remote-Sensing Image Classification · Advanced Neural Network Applications
MethodsGlobal Average Pooling · 1x1 Convolution · Batch Normalization · Bottleneck Residual Block · Kaiming Initialization · Residual Connection · Convolution · Residual Block · Average Pooling · Local Response Normalization
