When to Commit? Towards Variable-Size Self-Contained Blocks for Discrete Diffusion Language Models

Danny Wang; Ruihong Qiu; Zi Huang

arXiv:2604.23994·cs.LG·April 28, 2026

When to Commit? Towards Variable-Size Self-Contained Blocks for Discrete Diffusion Language Models

Danny Wang, Ruihong Qiu, Zi Huang

PDF

TL;DR

This paper introduces Variable-size Self-contained Blocks (VSB) for discrete diffusion language models, using a self-containedness criterion to improve block boundary decisions during decoding.

Contribution

It proposes a novel self-containedness criterion for block commitment and develops VSB, which adaptively selects block boundaries based on predictive divergence, improving decoding consistency.

Findings

01

VSB outperforms fixed-size and heuristic blockwise decoding in experiments.

02

Self-containedness correlates with predictive consistency and decoding quality.

03

Theoretical analysis links self-containedness to model prediction stability.

Abstract

Discrete diffusion language models (dLLMs) enable parallel token updates with bidirectional attention, yet practical generation typically adopts blockwise semi-autoregressive decoding. This switch creates a training-inference mismatch: training denoises with full-sequence context, while inference commits tokens within a bounded block without future context. Therefore, decoding with fixed-size or heuristic-based blocks can lead to premature token commitments, as decisions are made without full access to future context that could alter those choices. Motivated by this, we propose self-containedness as a principled criterion for block commitment. A block is self-contained if its predictions remain consistent with Future-Aware (FA) or without No-Future (NF) access to future context, reframing block boundary selection as a test of self-containedness rather than a heuristic choice. Based on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.