Generating High Fidelity Images with Subscale Pixel Networks and Multidimensional Upscaling
Jacob Menick, Nal Kalchbrenner

TL;DR
This paper introduces Subscale Pixel Networks and Multidimensional Upscaling to generate high fidelity large images efficiently, achieving state-of-the-art results on CelebA-HQ and ImageNet datasets.
Contribution
The paper presents a novel Subscale Pixel Network architecture and a Multidimensional Upscaling method for high fidelity image generation at large scales.
Findings
Achieved state-of-the-art likelihood results on CelebA-HQ and ImageNet.
Generated high fidelity large-scale images with improved efficiency.
Set new benchmarks in image generation quality.
Abstract
The unconditional generation of high fidelity images is a longstanding benchmark for testing the performance of image decoders. Autoregressive image models have been able to generate small images unconditionally, but the extension of these methods to large images where fidelity can be more readily assessed has remained an open problem. Among the major challenges are the capacity to encode the vast previous context and the sheer difficulty of learning a distribution that preserves both global semantic coherence and exactness of detail. To address the former challenge, we propose the Subscale Pixel Network (SPN), a conditional decoder architecture that generates an image as a sequence of sub-images of equal size. The SPN compactly captures image-wide spatial dependencies and requires a fraction of the memory and the computation required by other fully autoregressive models. To address the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCell Image Analysis Techniques · Generative Adversarial Networks and Image Synthesis · Advanced Image Processing Techniques
