Efficient Sampling with Discrete Diffusion Models: Sharp and Adaptive Guarantees
Daniil Dmitriev, Zhihan Huang, Yuting Wei

TL;DR
This paper provides theoretical guarantees for the efficiency of discrete diffusion models' sampling algorithms, showing they can achieve accuracy with fewer iterations and adapt to data structure, improving understanding of their convergence behavior.
Contribution
It establishes sharp convergence bounds for discrete diffusion sampling, introduces a structure-adaptive sampler, and proves lower bounds on iteration complexity, advancing theoretical understanding of these models.
Findings
Uniform diffusion sampler achieves / complexity, removing dependence on vocabulary size.
Lower bounds show linear dependence on ambient dimension is unavoidable.
Adaptive sampler exploits data structure, achieving sublinear convergence in practical scenarios.
Abstract
Diffusion models over discrete spaces have recently shown striking empirical success, yet their theoretical foundations remain incomplete. In this paper, we study the sampling efficiency of score-based discrete diffusion models under a continuous-time Markov chain (CTMC) formulation, with a focus on -leaping-based samplers. We establish sharp convergence guarantees for attaining accuracy in Kullback-Leibler (KL) divergence for both uniform and masking noising processes. For uniform discrete diffusion, we show that the -leaping algorithm achieves an iteration complexity of order , with the ambient dimension of the target distribution, eliminating linear dependence on the vocabulary size and improving existing bounds by a factor of ; moreover, we establish a matching algorithmic lower bound showing that linear dependence on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarkov Chains and Monte Carlo Methods · Bayesian Methods and Mixture Models · MRI in cancer diagnosis
