Hierarchical Autoregressive Image Models with Auxiliary Decoders

Jeffrey De Fauw; Sander Dieleman; Karen Simonyan

arXiv:1903.04933·cs.CV·October 9, 2019·26 cites

Hierarchical Autoregressive Image Models with Auxiliary Decoders

Jeffrey De Fauw, Sander Dieleman, Karen Simonyan

PDF

Open Access

TL;DR

This paper introduces hierarchical autoregressive image models with auxiliary decoders that learn abstract representations, enabling the generation of high-resolution, large-scale coherent images and outperforming existing models in realism.

Contribution

It proposes a novel hierarchical approach with auxiliary decoders to improve large-scale coherence in autoregressive image generation.

Findings

01

Models generate realistic 128x128 and 256x256 images.

02

Hierarchical models outperform non-hierarchical counterparts.

03

Human evaluation favors the proposed models over state-of-the-art alternatives.

Abstract

Autoregressive generative models of images tend to be biased towards capturing local structure, and as a result they often produce samples which are lacking in terms of large-scale coherence. To address this, we propose two methods to learn discrete representations of images which abstract away local detail. We show that autoregressive models conditioned on these representations can produce high-fidelity reconstructions of images, and that we can train autoregressive priors on these representations that produce samples with large-scale coherence. We can recursively apply the learning procedure, yielding a hierarchy of progressively more abstract image representations. We train hierarchical class-conditional autoregressive models on the ImageNet dataset and demonstrate that they are able to generate realistic images at resolutions of 128 $\times$ 128 and 256 $\times$ 256 pixels. We also…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Chaos-based Image/Signal Encryption · Advanced Image Processing Techniques