Efficient Semantic Image Synthesis via Class-Adaptive Normalization
Zhentao Tan, Dongdong Chen, Qi Chu, Menglei Chai, Jing, Liao, Mingming He, Lu Yuan, Gang Hua, Nenghai Yu

TL;DR
This paper introduces CLADE, a lightweight normalization method for semantic image synthesis that maintains high quality while reducing computational overhead compared to SPADE, by focusing on semantic class-awareness.
Contribution
The paper proposes CLADE, a novel class-adaptive normalization technique that simplifies spatial adaptiveness, and introduces CLADE-ICPE with intra-class positional encoding for improved spatial modulation.
Findings
CLADE achieves comparable quality to SPADE with fewer parameters.
CLADE-ICPE further enhances spatial adaptiveness.
The methods are validated on multiple challenging datasets.
Abstract
Spatially-adaptive normalization (SPADE) is remarkably successful recently in conditional semantic image synthesis \cite{park2019semantic}, which modulates the normalized activation with spatially-varying transformations learned from semantic layouts, to prevent the semantic information from being washed away. Despite its impressive performance, a more thorough understanding of the advantages inside the box is still highly demanded to help reduce the significant computation and parameter overhead introduced by this novel structure. In this paper, from a return-on-investment point of view, we conduct an in-depth analysis of the effectiveness of this spatially-adaptive normalization and observe that its modulation parameters benefit more from semantic-awareness rather than spatial-adaptiveness, especially for high-resolution input masks. Inspired by this observation, we propose…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Neural Network Applications · Advanced Vision and Imaging
MethodsSpatially-Adaptive Normalization · High-resolution input
