Segment-Level Diffusion: A Framework for Controllable Long-Form Generation with Diffusion Language Models

Xiaochen Zhu; Georgi Karadzhov; Chenxi Whitehouse; Andreas Vlachos

arXiv:2412.11333·cs.CL·May 27, 2025

Segment-Level Diffusion: A Framework for Controllable Long-Form Generation with Diffusion Language Models

Xiaochen Zhu, Georgi Karadzhov, Chenxi Whitehouse, Andreas Vlachos

PDF

Open Access 1 Video

TL;DR

This paper introduces Segment-Level Diffusion (SLD), a novel framework that improves long-form text generation by combining segmentation, robust representation learning, and latent-space guidance, outperforming existing diffusion models.

Contribution

SLD is the first to segment long text into latent representations for diffusion, enhancing scalability and coherence in long-form generation.

Findings

01

SLD achieves superior fluency and coherence in experiments.

02

SLD outperforms baseline diffusion and autoregressive models.

03

SLD demonstrates scalability for long-form text generation.

Abstract

Diffusion models have shown promise in text generation, but often struggle with generating long, coherent, and contextually accurate text. Token-level diffusion doesn't model word-order dependencies explicitly and operates on short, fixed output windows, while passage-level diffusion struggles with learning robust representations for long-form text. To address these challenges, we propose Segment-Level Diffusion (SLD), a framework that enhances diffusion-based text generation through text segmentation, robust representation training with adversarial and contrastive learning, and improved latent-space guidance. By segmenting long-form outputs into multiple latent representations and decoding them with an autoregressive decoder, SLD simplifies diffusion predictions and improves scalability. Experiments on four datasets demonstrate that, when compared to other diffusion and autoregressive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Segment-Level Diffusion: A Framework for Controllable Long-Form Generation with Diffusion Language Models· underline

Taxonomy

TopicsModel-Driven Software Engineering Techniques

MethodsDiffusion