Genie: Show Me the Data for Quantization
Yongkweon Jeon, Chungman Lee, Ho-young Kim

TL;DR
Genie introduces a novel post-training zero-shot quantization framework that synthesizes data from batch normalization parameters to produce high-quality quantized neural networks rapidly without real datasets.
Contribution
This paper presents a new post-training zero-shot quantization method and a data synthesis framework called Genie, bridging the gap between zero-shot and few-shot quantization with improved performance.
Findings
Achieves high-quality quantization within hours without real data
Generates synthetic data that enables robust model quantization
Outperforms existing zero-shot quantization methods
Abstract
Zero-shot quantization is a promising approach for developing lightweight deep neural networks when data is inaccessible owing to various reasons, including cost and issues related to privacy. By exploiting the learned parameters ( and ) of batch normalization layers in an FP32-pre-trained model, zero-shot quantization schemes focus on generating synthetic data. Subsequently, they distill knowledge from the pre-trained model (teacher) to the quantized model (student) such that the quantized model can be optimized with the synthetic dataset. However, thus far, zero-shot quantization has primarily been discussed in the context of quantization-aware training methods, which require task-specific losses and long-term optimization as much as retraining. We thus introduce a post-training quantization scheme for zero-shot quantization that produces high-quality quantized networks…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques
