Variation-Aware Semantic Image Synthesis

Mingle Xu; Jaehwan Lee; Sook Yoon; Hyongsuk Kim; Dong Sun; Park

arXiv:2301.10551·cs.CV·January 23, 2024

Variation-Aware Semantic Image Synthesis

Mingle Xu, Jaehwan Lee, Sook Yoon, Hyongsuk Kim, Dong Sun, Park

PDF

Open Access

TL;DR

This paper introduces a variation-aware approach to semantic image synthesis that enhances intra-class diversity, resulting in more natural and photorealistic images, by incorporating semantic noise and position codes.

Contribution

It proposes simple methods to improve intra-class variation in semantic image synthesis, addressing a key limitation of current algorithms.

Findings

01

Enhanced intra-class variation leads to more natural images.

02

Achieved slightly better FID and mIoU scores.

03

Compatible with state-of-the-art algorithms.

Abstract

Semantic image synthesis (SIS) aims to produce photorealistic images aligning to given conditional semantic layout and has witnessed a significant improvement in recent years. Although the diversity in image-level has been discussed heavily, class-level mode collapse widely exists in current algorithms. Therefore, we declare a new requirement for SIS to achieve more photorealistic images, variation-aware, which consists of inter- and intra-class variation. The inter-class variation is the diversity between different semantic classes while the intra-class variation stresses the diversity inside one class. Through analysis, we find that current algorithms elusively embrace the inter-class variation but the intra-class variation is still not enough. Further, we introduce two simple methods to achieve variation-aware semantic image synthesis (VASIS) with a higher intra-class variation,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Advanced Image and Video Retrieval Techniques · Generative Adversarial Networks and Image Synthesis