Training GANs with Optimism

Constantinos Daskalakis; Andrew Ilyas; Vasilis Syrgkanis; Haoyang Zeng

arXiv:1711.00141·cs.LG·February 14, 2018

Training GANs with Optimism

Constantinos Daskalakis, Andrew Ilyas, Vasilis Syrgkanis, Haoyang Zeng

PDF

1 Repo

TL;DR

This paper introduces Optimistic Mirror Descent (OMD) for training Wasserstein GANs, addressing limit cycling issues, and demonstrates improved convergence and performance over traditional methods in both theoretical and practical settings.

Contribution

It proposes OMD for WGAN training, proves its convergence in zero-sum games, and introduces Optimistic Adam, showing empirical improvements over existing algorithms.

Findings

01

OMD addresses limit cycling in WGAN training.

02

Models trained with OMD have smaller KL divergence.

03

Optimistic Adam improves CIFAR10 performance.

Abstract

We address the issue of limit cycling behavior in training Generative Adversarial Networks and propose the use of Optimistic Mirror Decent (OMD) for training Wasserstein GANs. Recent theoretical results have shown that optimistic mirror decent (OMD) can enjoy faster regret rates in the context of zero-sum games. WGANs is exactly a context of solving a zero-sum game with simultaneous no-regret dynamics. Moreover, we show that optimistic mirror decent addresses the limit cycling problem in training WGANs. We formally show that in the case of bi-linear zero-sum games the last iterate of OMD dynamics converges to an equilibrium, in contrast to GD dynamics which are bound to cycle. We also portray the huge qualitative difference between GD and OMD dynamics with toy examples, even when GD is modified with many adaptations proposed in the recent literature, such as gradient penalty or…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

vsyrgkanis/optimistic_GAN_training
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsAdam · Convolution · Wasserstein GAN