PixelSNAIL: An Improved Autoregressive Generative Model

Xi Chen; Nikhil Mishra; Mostafa Rohaninejad; Pieter Abbeel

arXiv:1712.09763·cs.LG·December 29, 2017·54 cites

PixelSNAIL: An Improved Autoregressive Generative Model

Xi Chen, Nikhil Mishra, Mostafa Rohaninejad, Pieter Abbeel

PDF

Open Access 5 Repos

TL;DR

PixelSNAIL introduces a novel autoregressive generative model combining causal convolutions with self-attention, achieving state-of-the-art density estimation results on image datasets like CIFAR-10 and ImageNet.

Contribution

The paper presents a new architecture that enhances autoregressive models with self-attention, improving long-range dependency modeling in image density estimation.

Findings

01

Achieved 2.85 bits per dim on CIFAR-10

02

Achieved 3.80 bits per dim on 32x32 ImageNet

03

Outperformed previous state-of-the-art models

Abstract

Autoregressive generative models consistently achieve the best results in density estimation tasks involving high dimensional data, such as images or audio. They pose density estimation as a sequence modeling task, where a recurrent neural network (RNN) models the conditional distribution over the next element conditioned on all previous elements. In this paradigm, the bottleneck is the extent to which the RNN can model long-range dependencies, and the most successful approaches rely on causal convolutions, which offer better access to earlier parts of the sequence than conventional RNNs. Taking inspiration from recent work in meta reinforcement learning, where dealing with long-range dependencies is also essential, we introduce a new generative model architecture that combines causal convolutions with self attention. In this note, we describe the resulting model and present…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Reinforcement Learning in Robotics · Domain Adaptation and Few-Shot Learning