Distance-aware Quantization
Dohyung kim, Junghyup Lee, Bumsub Ham

TL;DR
This paper introduces a novel distance-aware quantizer (DAQ) for neural network quantization, effectively addressing gradient mismatch and quantizer gap problems, leading to superior performance across benchmarks.
Contribution
The proposed DAQ combines a distance-aware soft rounding and an adaptive temperature controller to improve quantization training.
Findings
Outperforms state-of-the-art methods on standard benchmarks.
Effectively reduces gradient mismatch and quantizer gap issues.
Achieves superior accuracy across various bit-widths.
Abstract
We address the problem of network quantization, that is, reducing bit-widths of weights and/or activations to lighten network architectures. Quantization methods use a rounding function to map full-precision values to the nearest quantized ones, but this operation is not differentiable. There are mainly two approaches to training quantized networks with gradient-based optimizers. First, a straight-through estimator (STE) replaces the zero derivative of the rounding with that of an identity function, which causes a gradient mismatch problem. Second, soft quantizers approximate the rounding with continuous functions at training time, and exploit the rounding for quantization at test time. This alleviates the gradient mismatch, but causes a quantizer gap problem. We alleviate both problems in a unified framework. To this end, we introduce a novel quantizer, dubbed a distance-aware…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Image Enhancement Techniques · Image and Signal Denoising Methods
