Convergence Analysis of Discrete Diffusion Model: Exact Implementation through Uniformization
Hongrui Chen, Lexing Ying

TL;DR
This paper provides a theoretical analysis of discrete diffusion models formulated as continuous-time Markov chains, introducing an exact implementation via uniformization and deriving convergence guarantees.
Contribution
It introduces a uniformization-based algorithm for discrete diffusion models and offers theoretical guarantees on sampling accuracy under certain assumptions.
Findings
Derives Total Variation and KL divergence bounds for discrete diffusion sampling.
Aligns theoretical results with state-of-the-art continuous diffusion models.
Highlights advantages of discrete over continuous diffusion models.
Abstract
Diffusion models have achieved huge empirical success in data generation tasks. Recently, some efforts have been made to adapt the framework of diffusion models to discrete state space, providing a more natural approach for modeling intrinsically discrete data, such as language and graphs. This is achieved by formulating both the forward noising process and the corresponding reversed process as Continuous Time Markov Chains (CTMCs). In this paper, we investigate the theoretical properties of the discrete diffusion model. Specifically, we introduce an algorithm leveraging the uniformization of continuous Markov chains, implementing transitions on random time points. Under reasonable assumptions on the learning of the discrete score function, we derive Total Variation distance and KL divergence guarantees for sampling from any distribution on a hypercube. Our results align with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDifferential Equations and Numerical Methods
MethodsDiffusion · ALIGN
