Autoregressive Diffusion Models
Emiel Hoogeboom, Alexey A. Gritsenko, Jasmijn Bastings, Ben, Poole, Rianne van den Berg, Tim Salimans

TL;DR
Autoregressive Diffusion Models (ARDMs) unify and extend existing autoregressive and diffusion models, offering efficient training, parallel generation, and superior performance in data compression tasks, including single data point compression.
Contribution
This paper introduces ARDMs, a new class of models that generalize previous models, enabling scalable training, parallel sampling, and effective lossless compression.
Findings
ARDMs require fewer steps than discrete diffusion models for similar performance.
ARDMs support parallel generation adaptable to various computational budgets.
ARDMs achieve strong results in lossless compression, including single data point scenarios.
Abstract
We introduce Autoregressive Diffusion Models (ARDMs), a model class encompassing and generalizing order-agnostic autoregressive models (Uria et al., 2014) and absorbing discrete diffusion (Austin et al., 2021), which we show are special cases of ARDMs under mild assumptions. ARDMs are simple to implement and easy to train. Unlike standard ARMs, they do not require causal masking of model representations, and can be trained using an efficient objective similar to modern probabilistic diffusion models that scales favourably to highly-dimensional data. At test time, ARDMs support parallel generation which can be adapted to fit any given generation budget. We find that ARDMs require significantly fewer steps than discrete diffusion models to attain the same performance. Finally, we apply ARDMs to lossless compression, and show that they are uniquely suited to this task. Contrary to existing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsModel Reduction and Neural Networks · Fractional Differential Equations Solutions · Advanced Neuroimaging Techniques and Applications
MethodsTest · Diffusion
