You Only Need Adversarial Supervision for Semantic Image Synthesis
Vadim Sushko, Edgar Sch\"onfeld, Dan Zhang, Juergen Gall, Bernt, Schiele, Anna Khoreva

TL;DR
This paper introduces a simplified GAN model for semantic image synthesis that relies solely on adversarial supervision, achieving higher quality, better alignment, and more diversity without perceptual loss.
Contribution
The authors redesign the discriminator as a semantic segmentation network and enable multi-modal synthesis using a 3D noise tensor, eliminating the need for perceptual loss.
Findings
Achieves 6 FID and 5 mIoU improvements over state-of-the-art.
Produces images with higher fidelity and better semantic alignment.
Generates more diverse images closely matching real image distributions.
Abstract
Despite their recent successes, GAN models for semantic image synthesis still suffer from poor image quality when trained with only adversarial supervision. Historically, additionally employing the VGG-based perceptual loss has helped to overcome this issue, significantly improving the synthesis quality, but at the same time limiting the progress of GAN models for semantic image synthesis. In this work, we propose a novel, simplified GAN model, which needs only adversarial supervision to achieve high quality results. We re-design the discriminator as a semantic segmentation network, directly using the given semantic label maps as the ground truth for training. By providing stronger supervision to the discriminator as well as to the generator through spatially- and semantically-aware discriminator feedback, we are able to synthesize images of higher fidelity with better alignment to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Image Processing Techniques · Advanced Vision and Imaging
MethodsOASIS
