Fantastic Style Channels and Where to Find Them: A Submodular Framework   for Discovering Diverse Directions in GANs

Enis Simsar; Umut Kocasari; Ezgi G\"ulperi Er; Pinar Yanardag

arXiv:2203.08516·cs.CV·April 1, 2022

Fantastic Style Channels and Where to Find Them: A Submodular Framework for Discovering Diverse Directions in GANs

Enis Simsar, Umut Kocasari, Ezgi G\"ulperi Er, Pinar Yanardag

PDF

TL;DR

This paper introduces a submodular optimization framework to discover a diverse set of interpretable directions in the StyleGAN2 latent space, enhancing controllability and disentanglement in image manipulation tasks.

Contribution

We propose a novel submodular approach that efficiently finds diverse, representative directions in StyleGAN2's style space, improving over existing methods that find limited directions.

Findings

01

Finds more diverse manipulation directions

02

Produces more disentangled latent directions

03

Outperforms prior methods in qualitative and quantitative evaluations

Abstract

The discovery of interpretable directions in the latent spaces of pre-trained GAN models has recently become a popular topic. In particular, StyleGAN2 has enabled various image generation and manipulation tasks due to its rich and disentangled latent spaces. The discovery of such directions is typically done either in a supervised manner, which requires annotated data for each desired manipulation or in an unsupervised manner, which requires a manual effort to identify the directions. As a result, existing work typically finds only a handful of directions in which controllable edits can be made. In this study, we design a novel submodular framework that finds the most representative and diverse subset of directions in the latent space of StyleGAN2. Our approach takes advantage of the latent space of channel-wise style parameters, so-called style space, in which we cluster channels that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsPath Length Regularization · HuMan(Expedia)||How do I get a human at Expedia? · R1 Regularization · Convolution · Weight Demodulation