Diffusion-LM Improves Controllable Text Generation

Xiang Lisa Li; John Thickstun; Ishaan Gulrajani; Percy Liang,; Tatsunori B. Hashimoto

arXiv:2205.14217·cs.CL·May 31, 2022·237 cites

Diffusion-LM Improves Controllable Text Generation

Xiang Lisa Li, John Thickstun, Ishaan Gulrajani, Percy Liang,, Tatsunori B. Hashimoto

PDF

Open Access 1 Repo 1 Models 1 Video

TL;DR

This paper introduces Diffusion-LM, a non-autoregressive diffusion-based language model that enables fine-grained controllable text generation without re-training, outperforming previous methods on complex control tasks.

Contribution

The paper presents a novel diffusion-based language model that allows for complex, controllable text generation through a simple gradient-based method, addressing limitations of prior approaches.

Findings

01

Successfully controls six fine-grained text attributes

02

Outperforms prior methods significantly on control tasks

03

Demonstrates the effectiveness of diffusion models in NLP

Abstract

Controlling the behavior of language models (LMs) without re-training is a major open problem in natural language generation. While recent works have demonstrated successes on controlling simple sentence attributes (e.g., sentiment), there has been little progress on complex, fine-grained controls (e.g., syntactic structure). To address this challenge, we develop a new non-autoregressive language model based on continuous diffusions that we call Diffusion-LM. Building upon the recent successes of diffusion models in continuous domains, Diffusion-LM iteratively denoises a sequence of Gaussian vectors into word vectors, yielding a sequence of intermediate latent variables. The continuous, hierarchical nature of these intermediate variables enables a simple gradient-based algorithm to perform complex, controllable generation tasks. We demonstrate successful control of Diffusion-LM for six…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xiangli1999/diffusion-lm
jaxOfficial

Models

🤗
yasserrmd/diffusion-text-demo
model· 20 dl
20 dl

Videos

Diffusion-LM Improves Controllable Text Generation· slideslive

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis

MethodsDiffusion