Implicit Unlikelihood Training: Improving Neural Text Generation with   Reinforcement Learning

Evgeny Lagutin; Daniil Gavrilov; Pavel Kalaidin

arXiv:2101.04229·cs.CL·January 13, 2021

Implicit Unlikelihood Training: Improving Neural Text Generation with Reinforcement Learning

Evgeny Lagutin, Daniil Gavrilov, Pavel Kalaidin

PDF

1 Repo

TL;DR

This paper introduces a reinforcement learning approach combined with unlikelihood training to fine-tune language models, effectively reducing repetition and improving the quality of generated text.

Contribution

It proposes a novel method that integrates policy gradient reinforcement learning with unlikelihood training for better neural text generation.

Findings

01

Reduces repetition in generated text

02

Maintains language model quality

03

Outperforms baseline methods in control metrics

Abstract

Likelihood training and maximization-based decoding result in dull and repetitive generated texts even when using powerful language models (Holtzman et al., 2019). Adding a loss function for regularization was shown to improve text generation output by helping avoid unwanted properties, such as contradiction or repetition (Li at al., 2020). In this work, we propose fine-tuning a language model by using policy gradient reinforcement learning, directly optimizing for better generation. We apply this approach to minimizing repetition in generated text, and show that, when combined with unlikelihood training (Welleck et al., 2020), our method further reduces repetition without impacting the language model quality. We also evaluate other methods for improving generation at training and decoding time, and compare them using various metrics aimed at control for better text generation output.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

vklabmipt/implicit-unlikelihood-training
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.