Quantization of Generative Adversarial Networks for Efficient Inference: a Methodological Study
Pavel Andreev (1, 2, 3), Alexander Fritzler (1, 2, 4), Dmitry Vetrov, (1, 3, 5) ((1) Higher School of Economics, (2) Skolkovo Institute of Science, and Technology, (3) Samsung AI Center Moscow, (4) Yandex, (5) Samsung-HSE, Laboratory)

TL;DR
This paper investigates the application of quantization techniques to GANs, demonstrating successful 4/8-bit weight and 8-bit activation quantization across multiple architectures without quality loss, enabling efficient inference on edge devices.
Contribution
It provides the first extensive experimental analysis of quantization methods on GANs, offering practical recipes for effective low-bit inference.
Findings
Quantization can be successfully applied to GANs with minimal quality loss.
Practical quantization recipes for StyleGAN, SAGAN, and CycleGAN are proposed.
GANs can be efficiently deployed on edge devices using 4/8-bit weights and 8-bit activations.
Abstract
Generative adversarial networks (GANs) have an enormous potential impact on digital content creation, e.g., photo-realistic digital avatars, semantic content editing, and quality enhancement of speech and images. However, the performance of modern GANs comes together with massive amounts of computations performed during the inference and high energy consumption. That complicates, or even makes impossible, their deployment on edge devices. The problem can be reduced with quantization -- a neural network compression technique that facilitates hardware-friendly inference by replacing floating-point computations with low-bit integer ones. While quantization is well established for discriminative models, the performance of modern quantization techniques in application to GANs remains unclear. GANs generate content of a more complex structure than discriminative models, and thus quantization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Neural Network Applications · Music and Audio Processing
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Batch Normalization · Residual Connection · Cycle Consistency Loss · Residual Block · PatchGAN · Tanh Activation · Instance Normalization · Dense Connections · Convolution
