DuDGAN: Improving Class-Conditional GANs via Dual-Diffusion
Taesun Yeom, Minhyeok Lee

TL;DR
DuDGAN introduces a dual diffusion-based noise injection technique in GANs, enhancing class-conditional image generation by addressing mode collapse and training instability, and outperforming existing models on multiple datasets.
Contribution
The paper proposes DuDGAN, a novel class-conditional GAN framework utilizing dual diffusion noise injection, which improves training stability and sample quality over prior methods.
Findings
Outperforms state-of-the-art models on AFHQ, Food-101, and CIFAR-10 datasets.
Achieves superior FID, KID, Precision, and Recall scores.
Effectively mitigates mode collapse and training instability.
Abstract
Class-conditional image generation using generative adversarial networks (GANs) has been investigated through various techniques; however, it continues to face challenges such as mode collapse, training instability, and low-quality output in cases of datasets with high intra-class variation. Furthermore, most GANs often converge in larger iterations, resulting in poor iteration efficacy in training procedures. While Diffusion-GAN has shown potential in generating realistic samples, it has a critical limitation in generating class-conditional samples. To overcome these limitations, we propose a novel approach for class-conditional image generation using GANs called DuDGAN, which incorporates a dual diffusion-based noise injection process. Our method consists of three unique networks: a discriminator, a generator, and a classifier. During the training process, Gaussian-mixture noises are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Image Processing Techniques and Applications · Digital Media Forensic Detection
