Stable Parallel Training of Wasserstein Conditional Generative Adversarial Neural Networks
Massimiliano Lupo Pasini, Junqi Yin

TL;DR
This paper introduces a stable, scalable parallel training method for Wasserstein Conditional GANs that avoids inter-process communication, reduces mode collapse, and improves image quality on large datasets using extensive GPU resources.
Contribution
It presents a novel parallel training approach for W-CGANs that enhances stability and scalability without inter-process communication, outperforming previous methods.
Findings
Improved inception scores and Frechet distances over previous methods.
Achieved weak scaling on up to 2,000 GPUs.
Enhanced image quality in generated samples.
Abstract
We propose a stable, parallel approach to train Wasserstein Conditional Generative Adversarial Neural Networks (W-CGANs) under the constraint of a fixed computational budget. Differently from previous distributed GANs training techniques, our approach avoids inter-process communications, reduces the risk of mode collapse and enhances scalability by using multiple generators, each one of them concurrently trained on a single data label. The use of the Wasserstein metric also reduces the risk of cycling by stabilizing the training of each generator. We illustrate the approach on the CIFAR10, CIFAR100, and ImageNet1k datasets, three standard benchmark image datasets, maintaining the original resolution of the images for each dataset. Performance is assessed in terms of scalability and final accuracy within a limited fixed computational time and computational resources. To measure accuracy,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Neural Network Applications · Advanced Image Processing Techniques
