Improved Training of Wasserstein GANs

Ishaan Gulrajani; Faruk Ahmed; Martin Arjovsky; Vincent Dumoulin,; Aaron Courville

arXiv:1704.00028·cs.LG·December 27, 2017·1.5k cites

Improved Training of Wasserstein GANs

Ishaan Gulrajani, Faruk Ahmed, Martin Arjovsky, Vincent Dumoulin,, Aaron Courville

PDF

Open Access 5 Repos 2 Models

TL;DR

This paper introduces a gradient penalty method for training Wasserstein GANs, improving stability and sample quality over traditional weight clipping approaches, and enabling effective training of deep and diverse models.

Contribution

It proposes a gradient penalty technique as an alternative to weight clipping, enhancing the stability and quality of Wasserstein GAN training.

Findings

01

Gradient penalty outperforms weight clipping in WGANs.

02

Stable training achieved across various architectures including deep ResNets.

03

High-quality samples generated on CIFAR-10 and LSUN datasets.

Abstract

Generative Adversarial Networks (GANs) are powerful generative models, but suffer from training instability. The recently proposed Wasserstein GAN (WGAN) makes progress toward stable training of GANs, but sometimes can still generate only low-quality samples or fail to converge. We find that these problems are often due to the use of weight clipping in WGAN to enforce a Lipschitz constraint on the critic, which can lead to undesired behavior. We propose an alternative to clipping weights: penalize the norm of gradient of the critic with respect to its input. Our proposed method performs better than standard WGAN and enables stable training of a wide variety of GAN architectures with almost no hyperparameter tuning, including 101-layer ResNets and language models over discrete data. We also achieve high quality generations on CIFAR-10 and LSUN bedrooms.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Neural Network Applications · Adversarial Robustness in Machine Learning

MethodsResidual Connection · Average Pooling · 1x1 Convolution · Layer Normalization · Max Pooling · Global Average Pooling · Bottleneck Residual Block · Residual Block · Kaiming Initialization · Bitcoin Customer Service Number +1-833-534-1729