Channel-Recurrent Autoencoding for Image Modeling
Wenling Shang, Kihyuk Sohn, Yuandong Tian

TL;DR
This paper introduces a channel-recurrent VAE-GAN model that captures complex image features more effectively, producing diverse high-resolution images with interpretable latent spaces and improved training regularizations.
Contribution
It proposes a novel channel-recurrent architecture integrated with adversarial training and new regularizations, advancing image modeling capabilities over existing VAEs and VAE-GANs.
Findings
Outperforms VAE-GAN in generating diverse high-resolution images
Produces interpretable and expressive latent representations
Enhances training with novel regularization techniques
Abstract
Despite recent successes in synthesizing faces and bedrooms, existing generative models struggle to capture more complex image types, potentially due to the oversimplification of their latent space constructions. To tackle this issue, building on Variational Autoencoders (VAEs), we integrate recurrent connections across channels to both inference and generation steps, allowing the high-level features to be captured in global-to-local, coarse-to-fine manners. Combined with adversarial loss, our channel-recurrent VAE-GAN (crVAE-GAN) outperforms VAE-GAN in generating a diverse spectrum of high resolution images while maintaining the same level of computational efficacy. Our model produces interpretable and expressive latent representations to benefit downstream tasks such as image completion. Moreover, we propose two novel regularizations, namely the KL objective weighting scheme over time…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Face recognition and analysis · Digital Media Forensic Detection
