Training Multi-bit Quantized and Binarized Networks with A Learnable Symmetric Quantizer
Phuoc Pham, Jacob Abraham, Jaeyong Chung

TL;DR
This paper introduces UniQ, a unified quantization framework that effectively trains both multi-bit and binary neural networks using a learnable symmetric quantizer, improving accuracy and simplifying the training process.
Contribution
The paper proposes a novel unified quantization framework with a learnable symmetric quantizer, enabling effective training of both multi-bit and binary networks without architecture modifications.
Findings
UniQ outperforms existing methods on ImageNet for multi-bit quantization.
Achieves state-of-the-art accuracy in multi-bit quantization tasks.
Binarized networks trained with UniQ reach accuracy comparable to specialized methods.
Abstract
Quantizing weights and activations of deep neural networks is essential for deploying them in resource-constrained devices, or cloud platforms for at-scale services. While binarization is a special case of quantization, this extreme case often leads to several training difficulties, and necessitates specialized models and training methods. As a result, recent quantization methods do not provide binarization, thus losing the most resource-efficient option, and quantized and binarized networks have been distinct research areas. We examine binarization difficulties in a quantization framework and find that all we need to enable the binary training are a symmetric quantizer, good initialization, and careful hyperparameter selection. These techniques also lead to substantial improvements in multi-bit quantization. We demonstrate our unified quantization framework, denoted as UniQ, on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
MethodsPointwise Convolution · Average Pooling · Batch Normalization · Depthwise Convolution · Depthwise Separable Convolution · Inverted Residual Block · 1x1 Convolution · Convolution · Tether Customer Service Number +1-833-534-1729
