Swordsman: Entropy-Driven Adaptive Block Partition for Efficient Diffusion Language Models

Yu Zhang; Xinchen Li; Jialei Zhou; Hongnan Ma; Zhongwei Wan; Yiwei Shi; Duoqian Miao; Qi Zhang; Longbing Cao

arXiv:2602.04399·cs.CL·February 5, 2026

Swordsman: Entropy-Driven Adaptive Block Partition for Efficient Diffusion Language Models

Yu Zhang, Xinchen Li, Jialei Zhou, Hongnan Ma, Zhongwei Wan, Yiwei Shi, Duoqian Miao, Qi Zhang, Longbing Cao

PDF

Open Access

TL;DR

Swordsman introduces an entropy-driven adaptive block partitioning method for diffusion language models, improving inference efficiency and quality by aligning decoding with semantic boundaries without additional training.

Contribution

It proposes a novel entropy-based adaptive block partitioning framework that dynamically aligns decoding with semantic boundaries in diffusion language models.

Findings

01

Achieves state-of-the-art performance in efficiency and quality

02

Effectively aligns decoding with semantic or syntactic boundaries

03

Operates without additional training, supported by KV Cache

Abstract

Block-wise decoding effectively improves the inference speed and quality in diffusion language models (DLMs) by combining inter-block sequential denoising and intra-block parallel unmasking. However, existing block-wise decoding methods typically partition blocks in a rigid and fixed manner, which inevitably fragments complete semantic or syntactic constituents, leading to suboptimal performance. Inspired by the entropy reduction hypothesis (ERH), we recognize that constituent boundaries offer greater opportunities for uncertainty reduction, which motivates us to employ entropy analysis for identifying constituent boundaries. Therefore, we propose Swordsman, an entropy-driven adaptive block-wise decoding framework for DLMs. Swordsman adaptively partitions blocks by identifying entropy shifts between adjacent tokens to better align with semantic or syntactic constituent boundaries. In…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Generative Adversarial Networks and Image Synthesis · Speech Recognition and Synthesis