TextGAIL: Generative Adversarial Imitation Learning for Text Generation

Qingyang Wu; Lei Li; Zhou Yu

arXiv:2004.13796·cs.CL·April 28, 2021·5 cites

TextGAIL: Generative Adversarial Imitation Learning for Text Generation

Qingyang Wu, Lei Li, Zhou Yu

PDF

Open Access 1 Models 1 Video

TL;DR

TextGAIL introduces a novel adversarial imitation learning framework utilizing pre-trained language models and contrastive discriminators to enhance the quality and diversity of generated text, outperforming traditional GAN approaches.

Contribution

The paper proposes a new text generation method combining imitation learning, contrastive discriminators, and PPO, addressing GAN limitations in text generation.

Findings

01

TextGAIL outperforms MLE baselines in quality and diversity.

02

Discriminator effectively provides reliable reward signals.

03

Method is validated on various unconditional and conditional tasks.

Abstract

Generative Adversarial Networks (GANs) for text generation have recently received many criticisms, as they perform worse than their MLE counterparts. We suspect previous text GANs' inferior performance is due to the lack of a reliable guiding signal in their discriminators. To address this problem, we propose a generative adversarial imitation learning framework for text generation that uses large pre-trained language models to provide more reliable reward guidance. Our approach uses contrastive discriminator, and proximal policy optimization (PPO) to stabilize and improve text generation performance. For evaluation, we conduct experiments on a diverse set of unconditional and conditional text generation tasks. Experimental results show that TextGAIL achieves better performance in terms of both quality and diversity than the MLE baseline. We also validate our intuition that TextGAIL's…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
npc-engine/exported-bart-light-gail-chatbot
model· ♡ 2
♡ 2

Videos

TextGAIL: Generative Adversarial Imitation Learning for Text Generation· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Computational and Text Analysis Methods

MethodsLinear Layer · Cosine Annealing · Discriminative Fine-Tuning · Linear Warmup With Cosine Annealing · Byte Pair Encoding · GPT-2 · Adam · Softmax · Layer Normalization · Dropout