A Survey on Diffusion Language Models
Tianyi Li, Mingda Chen, Bowei Guo, Zhiqiang Shen

TL;DR
Diffusion Language Models are an emerging class of models that generate text in parallel, offering advantages in speed and bidirectional context understanding, with recent advancements making them competitive with traditional autoregressive models.
Contribution
This survey provides a comprehensive overview, taxonomy, and analysis of DLMs, including their principles, techniques, inference strategies, and multimodal extensions, highlighting current challenges and future directions.
Findings
DLMs achieve several-fold speed-up over autoregressive models.
Recent DLMs show performance comparable to autoregressive counterparts.
Advancements include improved decoding, caching, and multimodal capabilities.
Abstract
Diffusion Language Models (DLMs) are rapidly emerging as a powerful and promising alternative to the dominant autoregressive (AR) paradigm. By generating tokens in parallel through an iterative denoising process, DLMs possess inherent advantages in reducing inference latency and capturing bidirectional context, thereby enabling fine-grained control over the generation process. While achieving a several-fold speed-up, recent advancements have allowed DLMs to show performance comparable to their autoregressive counterparts, making them a compelling choice for various natural language processing tasks. In this survey, we provide a holistic overview of the current DLM landscape. We trace its evolution and relationship with other paradigms, such as autoregressive and masked language models, and cover both foundational principles and state-of-the-art models. Our work offers an up-to-date,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
