dLLM: Simple Diffusion Language Modeling
Zhanhui Zhou, Lingjie Chen, Hanghang Tong, Dawn Song

TL;DR
dLLM is an open-source framework that standardizes and simplifies the training, inference, and evaluation of diffusion language models, facilitating reproducibility, customization, and extension in the field.
Contribution
It introduces a unified, flexible framework for diffusion language models, enabling easy reproduction, fine-tuning, deployment, and creation of small models from scratch.
Findings
Standardized pipeline for diffusion language models
Reproducible recipes for small DLMs from scratch
Released checkpoints to accelerate research
Abstract
Although diffusion language models (DLMs) are evolving quickly, many recent models converge on a set of shared components. These components, however, are distributed across ad-hoc research codebases or lack transparent implementations, making them difficult to reproduce or extend. As the field accelerates, there is a clear need for a unified framework that standardizes these common components while remaining flexible enough to support new methods and architectures. To address this gap, we introduce dLLM, an open-source framework that unifies the core components of diffusion language modeling -- training, inference, and evaluation -- and makes them easy to customize for new designs. With dLLM, users can reproduce, finetune, deploy, and evaluate open-source large DLMs such as LLaDA and Dream through a standardized pipeline. The framework also provides minimal, reproducible recipes for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗dllm-hub/ModernBERT-large-chat-v0.1model· 821 dl· ♡ 17821 dl♡ 17
- 🤗dllm-hub/ModernBERT-base-chat-v0.1model· 585 dl· ♡ 7585 dl♡ 7
- 🤗dllm-hub/Qwen3-0.6B-diffusion-mdlm-v0.1model· 7.8k dl· ♡ 117.8k dl♡ 11
- 🤗dllm-hub/Qwen3-0.6B-diffusion-bd3lm-v0.1model· 3.6k dl· ♡ 103.6k dl♡ 10
- 🤗dllm-hub/Qwen2.5-Coder-0.5B-Instruct-diffusion-bd3lm-v0.1model· 60 dl· ♡ 160 dl♡ 1
- 🤗dllm-hub/Qwen2.5-Coder-0.5B-Instruct-diffusion-mdlm-v0.1model· 273 dl· ♡ 5273 dl♡ 5
- 🤗ruwwww/Qwen3-0.6B-diffusion-mdlm-v0.1model· 4 dl4 dl
- 🤗Rosedale/Qwen3-0.6B-diffusion-bd3lm-v0.1model· 13 dl13 dl
- 🤗OnAnOrange/Dream-7B-Instruct-s1k-sftmodel· 12 dl12 dl
- 🤗OnAnOrange/LLaDA-8B-Instruct-s1k-sftmodel· 33 dl33 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Language and cultural evolution · Computational and Text Analysis Methods
