Genie: Show Me the Data for Quantization

Yongkweon Jeon; Chungman Lee; Ho-young Kim

arXiv:2212.04780·cs.LG·August 9, 2023

Genie: Show Me the Data for Quantization

Yongkweon Jeon, Chungman Lee, Ho-young Kim

PDF

Open Access

TL;DR

Genie introduces a novel post-training zero-shot quantization framework that synthesizes data from batch normalization parameters to produce high-quality quantized neural networks rapidly without real datasets.

Contribution

This paper presents a new post-training zero-shot quantization method and a data synthesis framework called Genie, bridging the gap between zero-shot and few-shot quantization with improved performance.

Findings

01

Achieves high-quality quantization within hours without real data

02

Generates synthetic data that enables robust model quantization

03

Outperforms existing zero-shot quantization methods

Abstract

Zero-shot quantization is a promising approach for developing lightweight deep neural networks when data is inaccessible owing to various reasons, including cost and issues related to privacy. By exploiting the learned parameters ( $μ$ and $σ$ ) of batch normalization layers in an FP32-pre-trained model, zero-shot quantization schemes focus on generating synthetic data. Subsequently, they distill knowledge from the pre-trained model (teacher) to the quantized model (student) such that the quantized model can be optimized with the synthetic dataset. However, thus far, zero-shot quantization has primarily been discussed in the context of quantization-aware training methods, which require task-specific losses and long-term optimization as much as retraining. We thus introduce a post-training quantization scheme for zero-shot quantization that produces high-quality quantized networks…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques