Spatial Steerability of GANs via Self-Supervision from Discriminator

Jianyuan Wang; Lalit Bhagat; Ceyuan Yang; Yinghao Xu; Yujun Shen,; Hongdong Li; Bolei Zhou

arXiv:2301.08455·cs.CV·January 11, 2024

Spatial Steerability of GANs via Self-Supervision from Discriminator

Jianyuan Wang, Lalit Bhagat, Ceyuan Yang, Yinghao Xu, Yujun Shen,, Hongdong Li, Bolei Zhou

PDF

Open Access

TL;DR

This paper introduces a self-supervised method that enhances the spatial controllability of GANs by encoding Gaussian heatmaps into the model, allowing intuitive scene layout edits without extra annotations.

Contribution

It proposes a novel self-supervised approach that aligns heatmaps with discriminator attention, enabling spatial editing and improving synthesis quality without additional annotations.

Findings

01

Enables spatial editing of various scenes and objects.

02

Improves the quality of generated images.

03

Supports intuitive, fine-grained scene manipulation.

Abstract

Generative models make huge progress to the photorealistic image synthesis in recent years. To enable human to steer the image generation process and customize the output, many works explore the interpretable dimensions of the latent space in GANs. Existing methods edit the attributes of the output image such as orientation or color scheme by varying the latent code along certain directions. However, these methods usually require additional human annotations for each pretrained model, and they mostly focus on editing global attributes. In this work, we propose a self-supervised approach to improve the spatial steerability of GANs without searching for steerable directions in the latent space or requiring extra annotations. Specifically, we design randomly sampled Gaussian heatmaps to be encoded into the intermediate layers of generative models as spatial inductive bias. Along with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Visual Attention and Saliency Detection