EQ-Net: Elastic Quantization Neural Networks
Ke Xu, Lei Han, Ye Tian, Shangshang Yang, Xingyi Zhang

TL;DR
EQ-Net introduces a flexible, one-shot neural network quantization approach that supports various hardware-specific quantization forms through an elastic quantization space, regularization techniques, and efficient search methods.
Contribution
The paper proposes a novel elastic quantization space and training framework for flexible, hardware-adaptive neural network quantization, along with new regularization losses and an efficient search algorithm.
Findings
EQ-Net achieves comparable or better performance than static and state-of-the-art methods.
The elastic quantization space effectively adapts to different hardware requirements.
The proposed search method efficiently finds optimal mixed-precision quantization configurations.
Abstract
Current model quantization methods have shown their promising capability in reducing storage space and computation complexity. However, due to the diversity of quantization forms supported by different hardware, one limitation of existing solutions is that usually require repeated optimization for different scenarios. How to construct a model with flexible quantization forms has been less studied. In this paper, we explore a one-shot network quantization regime, named Elastic Quantization Neural Networks (EQ-Net), which aims to train a robust weight-sharing quantization supernet. First of all, we propose an elastic quantization space (including elastic bit-width, granularity, and symmetry) to adapt to various mainstream quantitative forms. Secondly, we propose the Weight Distribution Regularization Loss (WDR-Loss) and Group Progressive Guidance Loss (GPG-Loss) to bridge the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Neural Networks and Applications · Brain Tumor Detection and Classification
