A Distributional Approach to Controlled Text Generation

Muhammad Khalifa; Hady Elsahar; Marc Dymetman

arXiv:2012.11635·cs.CL·May 7, 2021·31 cites

A Distributional Approach to Controlled Text Generation

Muhammad Khalifa, Hady Elsahar, Marc Dymetman

PDF

Open Access 2 Repos 1 Video

TL;DR

This paper introduces a novel distributional framework for controlled text generation using pre-trained language models, enabling flexible pointwise and distributional constraints while minimizing divergence from the original model.

Contribution

It presents the first model capable of handling both pointwise and distributional constraints within a unified formal framework, utilizing an explicit energy-based model and adaptive policy gradient training.

Findings

01

Outperforms baselines in balancing constraint satisfaction and divergence

02

Demonstrates potential to mitigate bias in language models

03

Shows faster convergence with adaptive techniques

Abstract

We propose a Distributional Approach for addressing Controlled Text Generation from pre-trained Language Models (LMs). This approach permits to specify, in a single formal framework, both "pointwise" and "distributional" constraints over the target LM -- to our knowledge, the first model with such generality -- while minimizing KL divergence from the initial LM distribution. The optimal target distribution is then uniquely determined as an explicit EBM (Energy-Based Model) representation. From that optimal representation we then train a target controlled Autoregressive LM through an adaptive distributional variant of Policy Gradient. We conduct a first set of experiments over pointwise constraints showing the advantages of our approach over a set of baselines, in terms of obtaining a controlled LM balancing constraint satisfaction with divergence from the initial LM. We then perform…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

A Distributional Approach to Controlled Text Generation· slideslive

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Artificial Intelligence in Games

Methodsenergy-based model