DSB: Dynamic Sliding Block Scheduling for Diffusion LLMs

Lizhuo Luo; Shenggui Li; Yonggang Wen; Tianwei Zhang

arXiv:2602.05992·cs.CL·March 17, 2026

DSB: Dynamic Sliding Block Scheduling for Diffusion LLMs

Lizhuo Luo, Shenggui Li, Yonggang Wen, Tianwei Zhang

PDF

Open Access

TL;DR

This paper introduces DSB, a dynamic sliding block scheduling method for diffusion LLMs that adapts to semantic difficulty, improving both output quality and inference efficiency without additional training.

Contribution

The paper proposes a novel, training-free dynamic sliding block scheduling approach and a KV-cache mechanism, addressing limitations of fixed schedules in diffusion LLMs.

Findings

01

Improves generation quality across multiple models and benchmarks.

02

Enhances inference efficiency without additional training.

03

Demonstrates robustness and generality of the proposed methods.

Abstract

Diffusion large language models (dLLMs) have emerged as a promising alternative for text generation, distinguished by their native support for parallel decoding. In practice, block inference is crucial for avoiding order misalignment in global bidirectional decoding and improving output quality. However, the widely-used fixed, predefined block (naive) schedule is agnostic to semantic difficulty, making it a suboptimal strategy for both quality and efficiency: it can force premature commitments to uncertain positions while delaying easy positions near block boundaries. In this work, we analyze the limitations of naive block scheduling and disclose the importance of dynamically adapting the schedule to semantic difficulty for reliable and efficient inference. Motivated by this, we propose Dynamic Sliding Block (DSB), a training-free block scheduling method that uses a sliding block with a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Generative Adversarial Networks and Image Synthesis