On the Effects of Batch and Weight Normalization in Generative   Adversarial Networks

Sitao Xiang; Hao Li

arXiv:1704.03971·stat.ML·December 5, 2017·70 cites

On the Effects of Batch and Weight Normalization in Generative Adversarial Networks

Sitao Xiang, Hao Li

PDF

Open Access 3 Repos

TL;DR

This paper introduces weight normalization (WN) for GAN training, demonstrating it improves stability, efficiency, and sample quality over batch normalization (BN), supported by a new evaluation metric and extensive experiments.

Contribution

The paper proposes a novel weight normalization method for GANs, providing a systematic evaluation framework and showing superior performance and stability compared to batch normalization.

Findings

01

WN reduces mean squared reconstruction error by 10%

02

WN achieves more stable training on deep ResNet architectures

03

WGAN produces higher quality samples than BN-based GANs

Abstract

Generative adversarial networks (GANs) are highly effective unsupervised learning frameworks that can generate very sharp data, even for data such as images with complex, highly multimodal distributions. However GANs are known to be very hard to train, suffering from problems such as mode collapse and disturbing visual artifacts. Batch normalization (BN) techniques have been introduced to address the training. Though BN accelerates the training in the beginning, our experiments show that the use of BN can be unstable and negatively impact the quality of the trained model. The evaluation of BN and numerous other recent schemes for improving GAN training is hindered by the lack of an effective objective quality measure for GAN models. To address these issues, we first introduce a weight normalization (WN) approach for GAN training that significantly improves the stability, efficiency and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Image Processing Techniques · Digital Media Forensic Detection

MethodsAverage Pooling · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Bottleneck Residual Block · Global Average Pooling · Residual Block · Kaiming Initialization · Max Pooling · Residual Connection · Convolution