Dimension-Free Convergence of Discrete Diffusion Models: Adjoint Equations Induce the Right Space

Kelvin Kan; Xingjian Li; Benjamin J. Zhang; Tuhin Sahai; Stanley Osher; Markos A. Katsoulakis

arXiv:2605.17232·cs.LG·May 19, 2026

Dimension-Free Convergence of Discrete Diffusion Models: Adjoint Equations Induce the Right Space

Kelvin Kan, Xingjian Li, Benjamin J. Zhang, Tuhin Sahai, Stanley Osher, Markos A. Katsoulakis

PDF

TL;DR

This paper introduces a new theoretical framework for discrete diffusion models that guarantees convergence independent of the state space size, applicable to various priors and metrics.

Contribution

It develops a unified adjoint-equation-based approach that achieves dimension-free convergence guarantees in any IPM, overcoming limitations of prior KL and TV analyses.

Findings

01

First dimension-free convergence guarantees for discrete diffusion models

02

Applicable to masked and uniform priors with minimal assumptions

03

Introduces novel techniques like adjoint equations and score-marginal cancellation.

Abstract

Discrete diffusion has become a leading framework for generative modeling in various applications including language, vision, and biology. Existing convergence theory, however, exhibits fundamental limitations. KL-based analyses diverge under singular priors such as the masked distribution, while bounds in total variation (TV) depend on the state space size $S$ and become vacuous for modern language tasks, where vocabularies contain hundreds of thousands of tokens. We develop a unified adjoint-equation-based framework that establishes dimension-free convergence guarantees in any integral probability metric (IPM). To the best of our knowledge, our bounds are the first to be entirely free of $S$ and applicable to both masked and uniform priors. Importantly, our theory relies only on a single standard rate-matrix regularity assumption and is compatible with time-inhomogeneous schedules.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.