Improving Variable-Length Generation in Diffusion Language Models via Length Regularization
Zicong Cheng, Ruixuan Jia, Jia Li, Guo-Wei Yang, Meng-Hao Guo, Shi-Min Hu

TL;DR
This paper introduces LR-DLLM, a length-regularized inference method for diffusion language models that improves variable-length generation by explicitly modeling and correcting length bias, leading to more reliable and flexible text generation.
Contribution
It proposes a novel length regularization framework that enables diffusion language models to accurately determine and adapt generation length without retraining.
Findings
Achieves 51.3% Pass@1 on HumanEvalInfilling with unknown lengths.
Outperforms DreamOn with a +13.4% improvement.
Demonstrates effectiveness across four languages with a 14.3% gain.
Abstract
Diffusion Large Language Models (DLLMs) are inherently ill-suited for variable-length generation, as their inference is defined on a fixed-length canvas and implicitly assumes a known target length. When the length is unknown, as in realistic completion and infilling, naively comparing confidence across mask lengths becomes systematically biased, leading to under-generation or redundant continuations. In this paper, we show that this failure arises from an intrinsic lengthinduced bias in generation confidence estimates, leaving existing DLLMs without a robust way to determine generation length and making variablelength inference unreliable. To address this issue, we propose LR-DLLM, a length-regularized inference framework for DLLMs that treats generation length as an explicit variable and achieves reliable length determination at inference time. It decouples semantic compatibility from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification
