Language GANs Falling Short

Massimo Caccia; Lucas Caccia; William Fedus; Hugo Larochelle; Joelle; Pineau; Laurent Charlin

arXiv:1811.02549·cs.CL·February 21, 2020·82 cites

Language GANs Falling Short

Massimo Caccia, Lucas Caccia, William Fedus, Hugo Larochelle, Joelle, Pineau, Laurent Charlin

PDF

Open Access 1 Repo

TL;DR

This paper critically evaluates the effectiveness of GAN-based models for natural language generation, revealing that traditional maximum likelihood models often outperform GAN variants when considering quality and diversity, and highlighting flaws in current evaluation methods.

Contribution

The study challenges the prevailing belief that GANs are superior for NLG by demonstrating that MLE models outperform GAN variants across the entire quality-diversity spectrum and exposing flaws in standard evaluation metrics.

Findings

01

MLE models outperform GAN variants in quality and diversity.

02

Temperature tuning improves quality-diversity trade-offs more effectively than adversarial training.

03

Flaws in quality-only evaluation metrics can be exploited to artificially inflate performance.

Abstract

Generating high-quality text with sufficient diversity is essential for a wide range of Natural Language Generation (NLG) tasks. Maximum-Likelihood (MLE) models trained with teacher forcing have consistently been reported as weak baselines, where poor performance is attributed to exposure bias (Bengio et al., 2015; Ranzato et al., 2015); at inference time, the model is fed its own prediction instead of a ground-truth token, which can lead to accumulating errors and poor samples. This line of reasoning has led to an outbreak of adversarial based approaches for NLG, on the account that GANs do not suffer from exposure bias. In this work, we make several surprising observations which contradict common beliefs. First, we revisit the canonical evaluation framework for NLG, and point out fundamental flaws with quality-only evaluation: we show that one can outperform such metrics using a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

pclucas14/GansFallingShort
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

MethodsConvolution · Dogecoin Customer Service Number +1-833-534-1729