Diagonal Attention and Style-based GAN for Content-Style Disentanglement   in Image Generation and Translation

Gihyun Kwon; Jong Chul Ye

arXiv:2103.16146·cs.CV·July 26, 2021

Diagonal Attention and Style-based GAN for Content-Style Disentanglement in Image Generation and Translation

Gihyun Kwon, Jong Chul Ye

PDF

1 Repo

TL;DR

This paper introduces a hierarchical attention-based method for disentangling content and style in image generation, enabling more precise control over spatial features and improving over existing models in multiple datasets.

Contribution

The paper proposes a novel hierarchical adaptive Diagonal spatial Attention (DAT) layer combined with AdaIN for effective content-style disentanglement in GANs.

Findings

01

Outperforms existing models in disentanglement scores

02

Provides more flexible spatial control in generated images

03

Easily integrates into GAN inversion frameworks

Abstract

One of the important research topics in image generative models is to disentangle the spatial contents and styles for their separate control. Although StyleGAN can generate content feature vectors from random noises, the resulting spatial content control is primarily intended for minor spatial variations, and the disentanglement of global content and styles is by no means complete. Inspired by a mathematical understanding of normalization and attention, here we present a novel hierarchical adaptive Diagonal spatial ATtention (DAT) layers to separately manipulate the spatial contents from styles in a hierarchical manner. Using DAT and AdaIN, our method enables coarse-to-fine level disentanglement of spatial contents and styles. In addition, our generator can be easily integrated into the GAN inversion framework so that the content and style of translated images from multi-domain image…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

KwonGihyun/DiagonalGAN
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsHuMan(Expedia)||How do I get a human at Expedia? · R1 Regularization · Adaptive Instance Normalization · Dense Connections · Convolution · Feedforward Network · StyleGAN