Arbitrary Bit-width Network: A Joint Layer-Wise Quantization and Adaptive Inference Approach
Chen Tang, Haoyu Zhai, Kai Ouyang, Zhi Wang, Yifei Zhu, Wenwu Zhu

TL;DR
This paper introduces the Arbitrary Bit-width Network (ABN), a flexible deep learning model that dynamically adjusts layer-wise quantization bit-widths during inference to improve efficiency and accuracy on image classification tasks.
Contribution
The paper presents a novel super-network with shared weights allowing runtime layer-wise bit-width adjustments, enabling data-dependent adaptive inference without retraining.
Findings
Achieves 1.1% top-1 accuracy improvement on ImageNet.
Saves 36.2% BitOps during inference.
Maintains accuracy with diverse bit-width configurations.
Abstract
Conventional model quantization methods use a fixed quantization scheme to different data samples, which ignores the inherent "recognition difficulty" differences between various samples. We propose to feed different data samples with varying quantization schemes to achieve a data-dependent dynamic inference, at a fine-grained layer level. However, enabling this adaptive inference with changeable layer-wise quantization schemes is challenging because the combination of bit-widths and layers is growing exponentially, making it extremely difficult to train a single model in such a vast searching space and use it in practice. To solve this problem, we present the Arbitrary Bit-width Network (ABN), where the bit-widths of a single deep network can change at runtime for different data samples, with a layer-wise granularity. Specifically, first we build a weight-shared layer-wise quantizable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques
