A Frustratingly Simple Decoding Method for Neural Text Generation

Haoran Yang; Deng Cai; Huayang Li; Wei Bi; Wai Lam; Shuming Shi

arXiv:2305.12675·cs.CL·February 28, 2024·2 cites

A Frustratingly Simple Decoding Method for Neural Text Generation

Haoran Yang, Deng Cai, Huayang Li, Wei Bi, Wai Lam, Shuming Shi

PDF

Open Access 1 Repo

TL;DR

This paper presents Frustratingly Simple Decoding (FSD), a highly efficient and effective method for neural text generation that penalizes repeated text using an anti-language model, outperforming existing decoding strategies.

Contribution

The paper introduces FSD, a novel decoding method that requires no extra parameters and is computationally efficient, significantly improving text generation quality.

Findings

01

FSD outperforms nucleus sampling and strong baselines.

02

FSD is as fast as greedy search.

03

FSD effectively reduces repetitive text in generated outputs.

Abstract

We introduce a frustratingly simple, super efficient and surprisingly effective decoding method, which we call Frustratingly Simple Decoding (FSD), for neural text generation. The idea behind FSD is straightforward: we build an anti-LM based on previously generated text and use this anti-LM to penalize future generation of what has been generated. The anti-LM can be implemented as simple as an n-gram language model or a vectorized variant. In this way, FSD introduces no extra model parameters and negligible computational overhead (FSD can be as fast as greedy search). Despite the simplicity, FSD is surprisingly effective; Experiments show that FSD can outperform the canonical methods to date (i.e., nucleus sampling) as well as several strong baselines that were proposed recently.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lhryang/fsd
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification