Deep Automodulators

Ari Heljakka; Yuxin Hou; Juho Kannala; Arno Solin

arXiv:1912.10321·cs.LG·October 30, 2020

Deep Automodulators

Ari Heljakka, Yuxin Hou, Juho Kannala, Arno Solin

PDF

Open Access 2 Repos 1 Video

TL;DR

This paper introduces automodulators, a new type of generative autoencoder capable of high-quality image reconstruction and style-mixing, with novel training techniques demonstrated on multiple datasets.

Contribution

It presents the automodulator architecture, enabling faithful image reproduction and style-mixing, along with a training method for high-resolution, sharp outputs in autoencoders.

Findings

01

State-of-the-art autoencoder performance

02

High-resolution, sharp image outputs

03

Effective style-mixing capabilities

Abstract

We introduce a new category of generative autoencoders called automodulators. These networks can faithfully reproduce individual real-world input images like regular autoencoders, but also generate a fused sample from an arbitrary combination of several such images, allowing instantaneous 'style-mixing' and other new applications. An automodulator decouples the data flow of decoder operations from statistical properties thereof and uses the latent vector to modulate the former by the latter, with a principled approach for mutual disentanglement of decoder layers. Prior work has explored similar decoder architecture with GANs, but their focus has been on random sampling. A corresponding autoencoder could operate on real input images. For the first time, we show how to train such a general-purpose model with sharp outputs in high resolution, using novel training techniques, demonstrated…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Deep Automodulators· slideslive

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Music and Audio Processing · Digital Media Forensic Detection

MethodsSolana Customer Service Number +1-833-534-1729