Foundations of Diffusion Models in General State Spaces: A Self-Contained Introduction

Vincent Pauline; Tobias H\"oppe; Kirill Neklyudov; Alexander Tong; Stefan Bauer; Andrea Dittadi

arXiv:2512.05092·stat.ML·December 5, 2025

Foundations of Diffusion Models in General State Spaces: A Self-Contained Introduction

Vincent Pauline, Tobias H\"oppe, Kirill Neklyudov, Alexander Tong, Stefan Bauer, Andrea Dittadi

PDF

Open Access

TL;DR

This paper provides a comprehensive, self-contained introduction to diffusion models across both continuous and discrete state spaces, unifying their theoretical foundations and training methods.

Contribution

It develops a unified framework for diffusion processes in general state spaces, connecting continuous SDEs and discrete Markov chains with explicit derivations and training principles.

Findings

01

Unified diffusion framework for continuous and discrete spaces

02

Explicit derivation of Fokker-Planck and master equations

03

Insights into how forward corruption shapes reverse dynamics

Abstract

Although diffusion models now occupy a central place in generative modeling, introductory treatments commonly assume Euclidean data and seldom clarify their connection to discrete-state analogues. This article is a self-contained primer on diffusion over general state spaces, unifying continuous domains and discrete/categorical structures under one lens. We develop the discrete-time view (forward noising via Markov kernels and learned reverse dynamics) alongside its continuous-time limits -- stochastic differential equations (SDEs) in $R^{d}$ and continuous-time Markov chains (CTMCs) on finite alphabets -- and derive the associated Fokker--Planck and master equations. A common variational treatment yields the ELBO that underpins standard training losses. We make explicit how forward corruption choices -- Gaussian processes in continuous spaces and structured categorical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Markov Chains and Monte Carlo Methods · Embodied and Extended Cognition