Swapping Autoencoder for Deep Image Manipulation

Taesung Park; Jun-Yan Zhu; Oliver Wang; Jingwan Lu; Eli Shechtman,; Alexei A. Efros; Richard Zhang

arXiv:2007.00653·cs.CV·December 15, 2020·145 cites

Swapping Autoencoder for Deep Image Manipulation

Taesung Park, Jun-Yan Zhu, Oliver Wang, Jingwan Lu, Eli Shechtman,, Alexei A. Efros, Richard Zhang

PDF

Open Access 4 Repos 1 Video

TL;DR

The paper introduces the Swapping Autoencoder, a deep model that enables realistic and controllable image manipulation by encoding images into structure and texture components, facilitating various editing tasks.

Contribution

It presents a novel autoencoder architecture that separates structure and texture for effective image manipulation, improving over existing generative models in quality and efficiency.

Findings

01

Produces better image manipulation results than recent models.

02

Enables real-time editing of input images.

03

Efficiently encodes images for easy manipulation.

Abstract

Deep generative models have become increasingly effective at producing realistic images from randomly sampled seeds, but using such models for controllable manipulation of existing images remains challenging. We propose the Swapping Autoencoder, a deep model designed specifically for image manipulation, rather than random sampling. The key idea is to encode an image with two independent components and enforce that any swapped combination maps to a realistic image. In particular, we encourage the components to represent structure and texture, by enforcing one component to encode co-occurrent patch statistics across different parts of an image. As our method is trained with an encoder, finding the latent codes for a new input image becomes trivial, rather than cumbersome. As a result, it can be used to manipulate real input images in various ways, including texture swapping, local and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Swapping Autoencoder for Deep Image Manipulation· slideslive

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection · Computer Graphics and Visualization Techniques

MethodsHuMan(Expedia)||How do I get a human at Expedia? · R1 Regularization · Convolution · Weight Demodulation · Path Length Regularization · StyleGAN2