Long-Range Zero-Shot Generative Deep Network Quantization
Yan Luo, Yangcheng Gao, Zhao Zhang, Haijun Zhang, Mingliang Xu, Meng, Wang

TL;DR
This paper introduces LRQ, a novel zero-shot quantization method using a long-range generator with attention and adversarial margin techniques, significantly improving the quality of synthetic data and quantization performance without real data.
Contribution
The paper proposes a long-range generator with attention and an adversarial margin module, enhancing zero-shot quantization by producing more global and diverse synthetic data.
Findings
LRQ outperforms existing zero-shot quantization methods.
Long-range attention improves synthetic data diversity.
Decoupled knowledge distillation enhances transfer from full-precision models.
Abstract
Quantization approximates a deep network model with floating-point numbers by the one with low bit width numbers, in order to accelerate inference and reduce computation. Quantizing a model without access to the original data, zero-shot quantization can be accomplished by fitting the real data distribution by data synthesis. However, zero-shot quantization achieves inferior performance compared to the post-training quantization with real data. We find it is because: 1) a normal generator is hard to obtain high diversity of synthetic data, since it lacks long-range information to allocate attention to global features; 2) the synthetic images aim to simulate the statistics of real data, which leads to weak intra-class heterogeneity and limited feature richness. To overcome these problems, we propose a novel deep network quantizer, dubbed Long-Range Zero-Shot Generative Deep Network…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · COVID-19 diagnosis using AI
MethodsConvolution
