Zero-shot Adversarial Quantization
Yuang Liu, Wei Zhang, Jun Wang

TL;DR
This paper introduces ZAQ, a zero-shot adversarial quantization framework that enables effective model quantization without training data, using a novel discrepancy modeling and adversarial learning to synthesize data for fine-tuning.
Contribution
The paper proposes a new zero-shot quantization method with a two-level discrepancy model and adversarial training, improving performance without access to training data.
Findings
ZAQ outperforms existing zero-shot quantization methods.
The framework effectively synthesizes diverse data for model fine-tuning.
Extensive experiments validate the superiority of ZAQ on vision tasks.
Abstract
Model quantization is a promising approach to compress deep neural networks and accelerate inference, making it possible to be deployed on mobile and edge devices. To retain the high performance of full-precision models, most existing quantization methods focus on fine-tuning quantized model by assuming training datasets are accessible. However, this assumption sometimes is not satisfied in real situations due to data privacy and security issues, thereby making these quantization methods not applicable. To achieve zero-short model quantization without accessing training data, a tiny number of quantization methods adopt either post-training quantization or batch normalization statistics-guided data generation for fine-tuning. However, both of them inevitably suffer from low performance, since the former is a little too empirical and lacks training support for ultra-low precision…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning
MethodsBatch Normalization
