Language GANs Falling Short
Massimo Caccia, Lucas Caccia, William Fedus, Hugo Larochelle, Joelle, Pineau, Laurent Charlin

TL;DR
This paper critically evaluates the effectiveness of GAN-based models for natural language generation, revealing that traditional maximum likelihood models often outperform GAN variants when considering quality and diversity, and highlighting flaws in current evaluation methods.
Contribution
The study challenges the prevailing belief that GANs are superior for NLG by demonstrating that MLE models outperform GAN variants across the entire quality-diversity spectrum and exposing flaws in standard evaluation metrics.
Findings
MLE models outperform GAN variants in quality and diversity.
Temperature tuning improves quality-diversity trade-offs more effectively than adversarial training.
Flaws in quality-only evaluation metrics can be exploited to artificially inflate performance.
Abstract
Generating high-quality text with sufficient diversity is essential for a wide range of Natural Language Generation (NLG) tasks. Maximum-Likelihood (MLE) models trained with teacher forcing have consistently been reported as weak baselines, where poor performance is attributed to exposure bias (Bengio et al., 2015; Ranzato et al., 2015); at inference time, the model is fed its own prediction instead of a ground-truth token, which can lead to accumulating errors and poor samples. This line of reasoning has led to an outbreak of adversarial based approaches for NLG, on the account that GANs do not suffer from exposure bias. In this work, we make several surprising observations which contradict common beliefs. First, we revisit the canonical evaluation framework for NLG, and point out fundamental flaws with quality-only evaluation: we show that one can outperform such metrics using a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsConvolution · Dogecoin Customer Service Number +1-833-534-1729
