Semi-Relaxed Quantization with DropBits: Training Low-Bit Neural Networks via Bit-wise Regularization
Jung Hyun Lee, Jihun Yun, Sung Ju Hwang, Eunho Yang

TL;DR
This paper introduces Semi-Relaxed Quantization with DropBits, a novel approach for training low-bit neural networks that reduces bias and variance in quantization, enabling more efficient deployment on resource-limited devices.
Contribution
The paper proposes Semi-Relaxed Quantization with DropBits, a new method combining multi-class straight-through estimator and bit-dropping regularization to improve low-bit neural network training.
Findings
Semi-Relaxed Quantization outperforms Gumbel-Softmax based methods.
DropBits effectively reduces bias in quantization.
Learning heterogeneous quantization levels improves performance.
Abstract
Network quantization, which aims to reduce the bit-lengths of the network weights and activations, has emerged as one of the key ingredients to reduce the size of neural networks for their deployments to resource-limited devices. In order to overcome the nature of transforming continuous activations and weights to discrete ones, recent study called Relaxed Quantization (RQ) [Louizos et al. 2019] successfully employ the popular Gumbel-Softmax that allows this transformation with efficient gradient-based optimization. However, RQ with this Gumbel-Softmax relaxation still suffers from bias-variance trade-off depending on the temperature parameter of Gumbel-Softmax. To resolve the issue, we propose a novel method, Semi-Relaxed Quantization (SRQ) that uses multi-class straight-through estimator to effectively reduce the bias and variance, along with a new regularization technique, DropBits…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification
MethodsDropout
