$\rho$-$\texttt{EOS}$: Training-free Bidirectional Variable-Length Control for Masked Diffusion LLMs
Jingyi Yang, Yuxian Jiang, Jing Shao

TL;DR
This paper introduces $ ho$-$ exttt{EOS}$, a training-free method that enables flexible, bidirectional variable-length generation in masked diffusion LLMs by estimating the implicit density of end-of-sequence tokens during denoising.
Contribution
It presents a novel, training-free approach for dynamic length control in masked diffusion LLMs using implicit $ exttt{EOS}$ density estimation within a single denoising process.
Findings
Achieves bidirectional length adjustment during generation.
Improves inference efficiency and token utilization.
Performs comparably to fixed-length models on benchmarks.
Abstract
Beyond parallel generation and global context modeling, current masked diffusion large language models (masked dLLMs, i.e., LLaDA) suffer from a fundamental limitation: they require a predefined, fixed generation length, which lacks flexibility and forces an inevitable trade-off between output quality and computational efficiency. To address this, we study the denoising dynamics and find that the implicit density () of end-of-sequence () tokens serves as a reliable signal of generation sufficiency. In particular, the evolving implicit density during denoising reveals whether the current masked space is excessive or insufficient, thereby guiding the adjustment direction for generation length. Building on this insight, we propose \textbf{\rho\texttt{EOS}}, a training-free, single-stage strategy that enables bidirectional variable-length generation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Topic Modeling · Block Copolymer Self-Assembly
