VDLM: Variable Diffusion LMs via Robust Latent-to-Text Rendering

Shuhui Qu

arXiv:2602.15870·cs.CL·February 19, 2026

VDLM: Variable Diffusion LMs via Robust Latent-to-Text Rendering

Shuhui Qu

PDF

Open Access

TL;DR

VDLM introduces a modular diffusion-based language model that separates semantic planning from text rendering, enabling iterative refinement and improved long-form generation across diverse benchmarks.

Contribution

The paper presents VDLM, a novel variable diffusion language model that decouples planning and rendering, with embedding-space post-training and robust latent-to-text conversion.

Findings

01

Outperforms baselines on long-form generation tasks.

02

Effective in reasoning, math, and code benchmarks.

03

Robust to planner noise through embedding perturbations.

Abstract

Autoregressive language models decode left-to-right with irreversible commitments, limiting revision during multi-step reasoning. We propose \textbf{VDLM}, a modular variable diffusion language model that separates semantic planning from text rendering. VDLM applies LLaDA-style masked diffusion over semantic variable embeddings to enable iterative refinement in latent space, then post-trains the planner with trajectory-aware optimization using embedding-space rewards and values, avoiding text decoding inside the RL loop. To convert planned embeddings back to text, we use a \textbf{Vec2Text} renderer and introduce \textbf{embedding perturbations} to robustify decoding under planner noise. Across nine benchmarks spanning general reasoning, math, and code, VDLM is competitive in pre-training and yields substantial post-training improvements on long-form generation tasks, outperforming…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Machine Learning in Healthcare · Topic Modeling