Follow the Wisdom of the Crowd: Effective Text Generation via Minimum   Bayes Risk Decoding

Mirac Suzgun; Luke Melas-Kyriazi; Dan Jurafsky

arXiv:2211.07634·cs.CL·November 15, 2022·1 cites

Follow the Wisdom of the Crowd: Effective Text Generation via Minimum Bayes Risk Decoding

Mirac Suzgun, Luke Melas-Kyriazi, Dan Jurafsky

PDF

Open Access 1 Repo

TL;DR

This paper introduces crowd sampling, a Bayesian risk minimization decoding method inspired by the wisdom of the crowd, which improves diversity and quality in open-ended text generation tasks.

Contribution

It proposes a novel decoding approach that generalizes existing methods and effectively balances diversity and quality in natural language generation.

Findings

01

Improves ROUGE and BLEU scores by 3-7 points across multiple tasks

02

Achieves new state-of-the-art results on WebNLG and WMT'16

03

Provides a drop-in replacement for existing sampling methods

Abstract

In open-ended natural-language generation, existing text decoding methods typically struggle to produce text which is both diverse and high-quality. Greedy and beam search are known to suffer from text degeneration and linguistic diversity issues, while temperature, top-k, and nucleus sampling often yield diverse but low-quality outputs. In this work, we present crowd sampling, a family of decoding methods based on Bayesian risk minimization, to address this diversity-quality trade-off. Inspired by the principle of "the wisdom of the crowd," crowd sampling seeks to select a candidate from a pool of candidates that has the least expected risk (i.e., highest expected reward) under a generative model according to a given utility function. Crowd sampling can be seen as a generalization of numerous existing methods, including majority voting, and in practice, it can be used as a drop-in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

suzgunmirac/crowd-sampling
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Hate Speech and Cyberbullying Detection