On the Effects of Batch and Weight Normalization in Generative Adversarial Networks
Sitao Xiang, Hao Li

TL;DR
This paper introduces weight normalization (WN) for GAN training, demonstrating it improves stability, efficiency, and sample quality over batch normalization (BN), supported by a new evaluation metric and extensive experiments.
Contribution
The paper proposes a novel weight normalization method for GANs, providing a systematic evaluation framework and showing superior performance and stability compared to batch normalization.
Findings
WN reduces mean squared reconstruction error by 10%
WN achieves more stable training on deep ResNet architectures
WGAN produces higher quality samples than BN-based GANs
Abstract
Generative adversarial networks (GANs) are highly effective unsupervised learning frameworks that can generate very sharp data, even for data such as images with complex, highly multimodal distributions. However GANs are known to be very hard to train, suffering from problems such as mode collapse and disturbing visual artifacts. Batch normalization (BN) techniques have been introduced to address the training. Though BN accelerates the training in the beginning, our experiments show that the use of BN can be unstable and negatively impact the quality of the trained model. The evaluation of BN and numerous other recent schemes for improving GAN training is hindered by the lack of an effective objective quality measure for GAN models. To address these issues, we first introduce a weight normalization (WN) approach for GAN training that significantly improves the stability, efficiency and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Image Processing Techniques · Digital Media Forensic Detection
MethodsAverage Pooling · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Bottleneck Residual Block · Global Average Pooling · Residual Block · Kaiming Initialization · Max Pooling · Residual Connection · Convolution
