Semantic Hierarchy Emerges in Deep Generative Representations for Scene Synthesis
Ceyuan Yang, Yujun Shen, Bolei Zhou

TL;DR
This paper reveals that deep generative models like GANs develop a structured semantic hierarchy across layers, enabling understanding and manipulation of scene components in synthesized images.
Contribution
It uncovers the layered semantic hierarchy in GAN representations and quantifies the causal relationship between activations and image semantics.
Findings
Layer-wise representations encode hierarchical scene semantics.
Early layers determine spatial layout and configuration.
Later layers control scene attributes and colors.
Abstract
Despite the success of Generative Adversarial Networks (GANs) in image synthesis, there lacks enough understanding on what generative models have learned inside the deep generative representations and how photo-realistic images are able to be composed of the layer-wise stochasticity introduced in recent GANs. In this work, we show that highly-structured semantic hierarchy emerges as variation factors from synthesizing scenes from the generative representations in state-of-the-art GAN models, like StyleGAN and BigGAN. By probing the layer-wise representations with a broad set of semantics at different abstraction levels, we are able to quantify the causality between the activations and semantics occurring in the output image. Such a quantification identifies the human-understandable variation factors learned by GANs to compose scenes. The qualitative and quantitative results further…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Aesthetic Perception and Analysis · Computer Graphics and Visualization Techniques
MethodsSoftmax · *Communicated@Fast*How Do I Communicate to Expedia? · Conditional Batch Normalization · Residual Block · Two Time-scale Update Rule · GAN Hinge Loss · Residual Connection · Non-Local Operation · Non-Local Block · Truncation Trick
