Variation-Aware Semantic Image Synthesis
Mingle Xu, Jaehwan Lee, Sook Yoon, Hyongsuk Kim, Dong Sun, Park

TL;DR
This paper introduces a variation-aware approach to semantic image synthesis that enhances intra-class diversity, resulting in more natural and photorealistic images, by incorporating semantic noise and position codes.
Contribution
It proposes simple methods to improve intra-class variation in semantic image synthesis, addressing a key limitation of current algorithms.
Findings
Enhanced intra-class variation leads to more natural images.
Achieved slightly better FID and mIoU scores.
Compatible with state-of-the-art algorithms.
Abstract
Semantic image synthesis (SIS) aims to produce photorealistic images aligning to given conditional semantic layout and has witnessed a significant improvement in recent years. Although the diversity in image-level has been discussed heavily, class-level mode collapse widely exists in current algorithms. Therefore, we declare a new requirement for SIS to achieve more photorealistic images, variation-aware, which consists of inter- and intra-class variation. The inter-class variation is the diversity between different semantic classes while the intra-class variation stresses the diversity inside one class. Through analysis, we find that current algorithms elusively embrace the inter-class variation but the intra-class variation is still not enough. Further, we introduce two simple methods to achieve variation-aware semantic image synthesis (VASIS) with a higher intra-class variation,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Image and Video Retrieval Techniques · Generative Adversarial Networks and Image Synthesis
