FUDGE: Controlled Text Generation With Future Discriminators

Kevin Yang; Dan Klein

arXiv:2104.05218·cs.CL·August 17, 2021

FUDGE: Controlled Text Generation With Future Discriminators

Kevin Yang, Dan Klein

PDF

3 Repos

TL;DR

FUDGE is a modular method for controlled text generation that adjusts a base model’s output probabilities using learned attribute predictors, enabling flexible conditioning on multiple attributes across various tasks.

Contribution

FUDGE introduces a novel approach that uses future discriminators to condition text generation on desired attributes with minimal access to the base model's internals.

Findings

01

Improves control over text attributes in multiple tasks

02

Easily combines multiple attribute predictors

03

Achieves measurable gains in poetry, topic control, and translation

Abstract

We propose Future Discriminators for Generation (FUDGE), a flexible and modular method for controlled text generation. Given a pre-existing model G for generating text from a distribution of interest, FUDGE enables conditioning on a desired attribute a (for example, formality) while requiring access only to G's output logits. FUDGE learns an attribute predictor operating on a partial sequence, and uses this predictor's outputs to adjust G's original probabilities. We show that FUDGE models terms corresponding to a Bayesian decomposition of the conditional distribution of G given attribute a. Moreover, FUDGE can easily compose predictors for multiple desired attributes. We evaluate FUDGE on three tasks -- couplet completion in poetry, topic control in language generation, and formality change in machine translation -- and observe gains in all three tasks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.