Sharp Convergence Rates for Masked Diffusion Models
Yuchen Liang, Zhiheng Tan, Ness Shroff, Yingbin Liang

TL;DR
This paper provides new theoretical convergence guarantees for masked diffusion models, especially the Euler and FHS samplers, using total variation analysis to improve understanding and establish tight bounds.
Contribution
It introduces a TV-based analysis for the Euler method that relaxes assumptions and provides tight bounds, and offers the first lower bound for Euler convergence and an analysis of FHS error.
Findings
Euler method convergence bounds are improved and assumptions relaxed.
FHS incurs no additional sampling error beyond score estimation.
Established tight lower bounds for both Euler and FHS samplers.
Abstract
Discrete diffusion models have achieved strong empirical performance in text and other symbolic domains, with masked (absorbing-rate) variants emerging as competitive alternatives to autoregressive models. Among existing samplers, the Euler method remains the standard choice in many applications, and more recently, the First-Hitting Sampler (FHS) has shown considerable promise for masked diffusion models. Despite their practical success, the theoretical understanding of these samplers remains limited. Existing analyses are conducted in Kullback-Leibler (KL) divergence, which often yields loose parameter dependencies and requires strong assumptions on score estimation. Moreover, these guarantees do not cover recently developed high-performance sampler of FHS. In this work, we first develop a direct total-variation (TV) based analysis for the Euler method that overcomes these limitations.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Markov Chains and Monte Carlo Methods · Advanced Neuroimaging Techniques and Applications
