Categorical SDEs with Simplex Diffusion
Pierre H. Richemond, Sander Dieleman, Arnaud Doucet

TL;DR
This paper introduces Simplex Diffusion, a novel method for diffusing categorical data on probability simplexes using a CIR process, extending diffusion models beyond continuous data.
Contribution
It proposes a new SDE framework for diffusing categorical data on simplexes, connecting it with the Dirichlet distribution and CIR processes, with insights on numerical implementation.
Findings
Relates Simplex Diffusion to Dirichlet distribution
Utilizes CIR process for diffusion on simplexes
Discusses numerical implementation and limitations
Abstract
Diffusion models typically operate in the standard framework of generative modelling by producing continuously-valued datapoints. To this end, they rely on a progressive Gaussian smoothing of the original data distribution, which admits an SDE interpretation involving increments of a standard Brownian motion. However, some applications such as text generation or reinforcement learning might naturally be better served by diffusing categorical-valued data, i.e., lifting the diffusion to a space of probability distributions. To this end, this short theoretical note proposes Simplex Diffusion, a means to directly diffuse datapoints located on an n-dimensional probability simplex. We show how this relates to the Dirichlet distribution on the simplex and how the analogous SDE is realized thanks to a multi-dimensional Cox-Ingersoll-Ross process (abbreviated as CIR), previously used in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Systems and Time Series Analysis
MethodsDiffusion
