Interpreting Generative Adversarial Networks for Interactive Image   Generation

Bolei Zhou

arXiv:2108.04896·cs.CV·February 3, 2022·1 cites

Interpreting Generative Adversarial Networks for Interactive Image Generation

Bolei Zhou

PDF

Open Access

TL;DR

This paper reviews recent methods for interpreting GANs, focusing on understanding how they generate realistic images from random vectors and enabling interactive image editing.

Contribution

It categorizes interpretation techniques into supervised, unsupervised, and embedding-guided approaches, highlighting how human-understandable concepts emerge in GAN representations.

Findings

01

Interpretation methods help understand GAN image generation.

02

Human-understandable concepts can be identified in GAN representations.

03

These concepts enable interactive image editing.

Abstract

Significant progress has been made by the advances in Generative Adversarial Networks (GANs) for image generation. However, there lacks enough understanding of how a realistic image is generated by the deep representations of GANs from a random vector. This chapter gives a summary of recent works on interpreting deep generative models. The methods are categorized into the supervised, the unsupervised, and the embedding-guided approaches. We will see how the human-understandable concepts that emerge in the learned representation can be identified and used for interactive image generation and editing.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Aesthetic Perception and Analysis