When to Commit? Towards Variable-Size Self-Contained Blocks for Discrete Diffusion Language Models
Danny Wang, Ruihong Qiu, Zi Huang

TL;DR
This paper introduces Variable-size Self-contained Blocks (VSB) for discrete diffusion language models, using a self-containedness criterion to improve block boundary decisions during decoding.
Contribution
It proposes a novel self-containedness criterion for block commitment and develops VSB, which adaptively selects block boundaries based on predictive divergence, improving decoding consistency.
Findings
VSB outperforms fixed-size and heuristic blockwise decoding in experiments.
Self-containedness correlates with predictive consistency and decoding quality.
Theoretical analysis links self-containedness to model prediction stability.
Abstract
Discrete diffusion language models (dLLMs) enable parallel token updates with bidirectional attention, yet practical generation typically adopts blockwise semi-autoregressive decoding. This switch creates a training-inference mismatch: training denoises with full-sequence context, while inference commits tokens within a bounded block without future context. Therefore, decoding with fixed-size or heuristic-based blocks can lead to premature token commitments, as decisions are made without full access to future context that could alter those choices. Motivated by this, we propose self-containedness as a principled criterion for block commitment. A block is self-contained if its predictions remain consistent with Future-Aware (FA) or without No-Future (NF) access to future context, reframing block boundary selection as a test of self-containedness rather than a heuristic choice. Based on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
