Structured Denoising Diffusion Models in Discrete State-Spaces

Jacob Austin; Daniel D. Johnson; Jonathan Ho; Daniel Tarlow; Rianne; van den Berg

arXiv:2107.03006·cs.LG·February 23, 2023·82 cites

Structured Denoising Diffusion Models in Discrete State-Spaces

Jacob Austin, Daniel D. Johnson, Jonathan Ho, Daniel Tarlow, Rianne, van den Berg

PDF

Open Access 5 Repos 2 Models 1 Video

TL;DR

This paper introduces Discrete Denoising Diffusion Probabilistic Models (D3PMs), extending diffusion models to discrete data and demonstrating improved image and text generation results over existing methods.

Contribution

The paper generalizes diffusion models to discrete spaces with novel transition matrices and a new loss function, connecting diffusion with autoregressive and mask-based models.

Findings

01

Achieves strong character-level text generation results.

02

Surpasses continuous DDPM in CIFAR-10 sample quality.

03

Improves log-likelihood on CIFAR-10.

Abstract

Denoising diffusion probabilistic models (DDPMs) (Ho et al. 2020) have shown impressive results on image and waveform generation in continuous state spaces. Here, we introduce Discrete Denoising Diffusion Probabilistic Models (D3PMs), diffusion-like generative models for discrete data that generalize the multinomial diffusion model of Hoogeboom et al. 2021, by going beyond corruption processes with uniform transition probabilities. This includes corruption with transition matrices that mimic Gaussian kernels in continuous space, matrices based on nearest neighbors in embedding space, and matrices that introduce absorbing states. The third allows us to draw a connection between diffusion models and autoregressive and mask-based generative models. We show that the choice of transition matrix is an important design decision that leads to improved results in image and text domains. We also…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Videos

Structured Denoising Diffusion Models in Discrete State-Spaces· slideslive

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Music and Audio Processing · Topic Modeling

MethodsDiffusion