Dimension-Free Convergence of Discrete Diffusion Models: Adjoint Equations Induce the Right Space
Kelvin Kan, Xingjian Li, Benjamin J. Zhang, Tuhin Sahai, Stanley Osher, Markos A. Katsoulakis

TL;DR
This paper introduces a new theoretical framework for discrete diffusion models that guarantees convergence independent of the state space size, applicable to various priors and metrics.
Contribution
It develops a unified adjoint-equation-based approach that achieves dimension-free convergence guarantees in any IPM, overcoming limitations of prior KL and TV analyses.
Findings
First dimension-free convergence guarantees for discrete diffusion models
Applicable to masked and uniform priors with minimal assumptions
Introduces novel techniques like adjoint equations and score-marginal cancellation.
Abstract
Discrete diffusion has become a leading framework for generative modeling in various applications including language, vision, and biology. Existing convergence theory, however, exhibits fundamental limitations. KL-based analyses diverge under singular priors such as the masked distribution, while bounds in total variation (TV) depend on the state space size and become vacuous for modern language tasks, where vocabularies contain hundreds of thousands of tokens. We develop a unified adjoint-equation-based framework that establishes dimension-free convergence guarantees in any integral probability metric (IPM). To the best of our knowledge, our bounds are the first to be entirely free of and applicable to both masked and uniform priors. Importantly, our theory relies only on a single standard rate-matrix regularity assumption and is compatible with time-inhomogeneous schedules.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
