Quantization of Generative Adversarial Networks for Efficient Inference:   a Methodological Study

Pavel Andreev (1; 2; 3); Alexander Fritzler (1; 2; 4); Dmitry Vetrov; (1; 3; 5) ((1) Higher School of Economics; (2) Skolkovo Institute of Science; and Technology; (3) Samsung AI Center Moscow; (4) Yandex; (5) Samsung-HSE; Laboratory)

arXiv:2108.13996·cs.AI·September 1, 2021

Quantization of Generative Adversarial Networks for Efficient Inference: a Methodological Study

Pavel Andreev (1, 2, 3), Alexander Fritzler (1, 2, 4), Dmitry Vetrov, (1, 3, 5) ((1) Higher School of Economics, (2) Skolkovo Institute of Science, and Technology, (3) Samsung AI Center Moscow, (4) Yandex, (5) Samsung-HSE, Laboratory)

PDF

Open Access

TL;DR

This paper investigates the application of quantization techniques to GANs, demonstrating successful 4/8-bit weight and 8-bit activation quantization across multiple architectures without quality loss, enabling efficient inference on edge devices.

Contribution

It provides the first extensive experimental analysis of quantization methods on GANs, offering practical recipes for effective low-bit inference.

Findings

01

Quantization can be successfully applied to GANs with minimal quality loss.

02

Practical quantization recipes for StyleGAN, SAGAN, and CycleGAN are proposed.

03

GANs can be efficiently deployed on edge devices using 4/8-bit weights and 8-bit activations.

Abstract

Generative adversarial networks (GANs) have an enormous potential impact on digital content creation, e.g., photo-realistic digital avatars, semantic content editing, and quality enhancement of speech and images. However, the performance of modern GANs comes together with massive amounts of computations performed during the inference and high energy consumption. That complicates, or even makes impossible, their deployment on edge devices. The problem can be reduced with quantization -- a neural network compression technique that facilitates hardware-friendly inference by replacing floating-point computations with low-bit integer ones. While quantization is well established for discriminative models, the performance of modern quantization techniques in application to GANs remains unclear. GANs generate content of a more complex structure than discriminative models, and thus quantization…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Neural Network Applications · Music and Audio Processing

Methods*Communicated@Fast*How Do I Communicate to Expedia? · Batch Normalization · Residual Connection · Cycle Consistency Loss · Residual Block · PatchGAN · Tanh Activation · Instance Normalization · Dense Connections · Convolution