Diverse Semantic Image Synthesis via Probability Distribution Modeling
Zhentao Tan, Menglei Chai, Dongdong Chen, Jing Liao, Qi, Chu, Bin Liu, Gang Hua, Nenghai Yu

TL;DR
This paper introduces a novel framework for diverse semantic image synthesis that models class distributions as continuous probabilities, enabling multimodal and instance-level diverse image generation with improved control and quality.
Contribution
It proposes modeling class-level modulation parameters as continuous distributions and introduces instance-adaptive stochastic sampling and prior noise remapping for enhanced diversity and style control.
Findings
Achieves superior diversity compared to state-of-the-art methods
Maintains comparable image quality
Supports exemplar-based style control
Abstract
Semantic image synthesis, translating semantic layouts to photo-realistic images, is a one-to-many mapping problem. Though impressive progress has been recently made, diverse semantic synthesis that can efficiently produce semantic-level multimodal results, still remains a challenge. In this paper, we propose a novel diverse semantic image synthesis framework from the perspective of semantic class distributions, which naturally supports diverse generation at semantic or even instance level. We achieve this by modeling class-level conditional modulation parameters as continuous probability distributions instead of discrete values, and sampling per-instance modulation parameters through instance-adaptive stochastic sampling that is consistent across the network. Moreover, we propose prior noise remapping, through linear perturbation parameters encoded from paired references, to facilitate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Image Enhancement Techniques · Advanced Vision and Imaging
