Bitwidth-Adaptive Quantization-Aware Neural Network Training: A Meta-Learning Approach
Jiseok Youn, Jaehun Song, Hyung-Sin Kim, Saewoong Bahk

TL;DR
This paper introduces MEBQAT, a meta-learning based method for neural network quantization that adapts to various bitwidths and target classes, enabling efficient deployment with minimal accuracy loss.
Contribution
It proposes a novel meta-learning approach for bitwidth-adaptive quantization-aware training, supporting both adaptive quantization and few-shot adaptation to unseen classes.
Findings
MEBQAT outperforms existing QAT methods in robustness and accuracy.
Supports joint adaptation of bitwidths and target classes.
Effective across multiple quantization schemes.
Abstract
Deep neural network quantization with adaptive bitwidths has gained increasing attention due to the ease of model deployment on various platforms with different resource budgets. In this paper, we propose a meta-learning approach to achieve this goal. Specifically, we propose MEBQAT, a simple yet effective way of bitwidth-adaptive quantization aware training (QAT) where meta-learning is effectively combined with QAT by redefining meta-learning tasks to incorporate bitwidths. After being deployed on a platform, MEBQAT allows the (meta-)trained model to be quantized to any candidate bitwidth then helps to conduct inference without much accuracy drop from quantization. Moreover, with a few-shot learning scenario, MEBQAT can also adapt a model to any bitwidth as well as any unseen target classes by adding conventional optimization or metric-based meta-learning. We design variants of MEBQAT…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Cancer-related molecular mechanisms research · Advanced Neural Network Applications
MethodsAttentive Walk-Aggregating Graph Neural Network
