Diffusion Guided Language Modeling
Justin Lovelace, Varsha Kishore, Yiwei Chen, Kilian Q. Weinberger

TL;DR
This paper introduces a guided diffusion approach that combines the fluency of auto-regressive models with the flexibility of diffusion for attribute-controlled text generation, outperforming previous methods.
Contribution
It presents a novel framework that uses diffusion to steer auto-regressive language models, enabling flexible attribute control with minimal additional training.
Findings
Outperforms previous plug-and-play guidance methods on benchmark datasets.
Controls new attributes with a single logistic regression classifier.
Maintains high fluency while guiding attribute-specific text generation.
Abstract
Current language models demonstrate remarkable proficiency in text generation. However, for many applications it is desirable to control attributes, such as sentiment, or toxicity, of the generated language -- ideally tailored towards each specific use case and target audience. For auto-regressive language models, existing guidance methods are prone to decoding errors that cascade during generation and degrade performance. In contrast, text diffusion models can easily be guided with, for example, a simple linear sentiment classifier -- however they do suffer from significantly higher perplexity than auto-regressive alternatives. In this paper we use a guided diffusion model to produce a latent proposal that steers an auto-regressive language model to generate text with desired properties. Our model inherits the unmatched fluency of the auto-regressive approach and the plug-and-play…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques
MethodsDiffusion · Logistic Regression
