On the Efficacy of Sampling Adapters

Clara Meister; Tiago Pimentel; Luca Malagutti; Ethan G. Wilcox; Ryan; Cotterell

arXiv:2307.03749·cs.CL·January 8, 2024

On the Efficacy of Sampling Adapters

Clara Meister, Tiago Pimentel, Luca Malagutti, Ethan G. Wilcox, Ryan, Cotterell

PDF

1 Repo

TL;DR

This paper introduces a unified framework called sampling adapters that modifies language model sampling distributions, improving text quality by balancing precision and recall, and correlating with higher sequence-level quality scores.

Contribution

It formalizes sampling adapters as a trade-off between precision and recall, explaining their effectiveness in enhancing text quality beyond standard metrics.

Findings

01

Sampling adapters increase the alignment of generated text with true distributions.

02

They improve sequence-level quality scores such as Mauve.

03

Local distribution changes lead to more coherent and grammatical text.

Abstract

Sampling is a common strategy for generating text from probabilistic models, yet standard ancestral sampling often results in text that is incoherent or ungrammatical. To alleviate this issue, various modifications to a model's sampling distribution, such as nucleus or top-k sampling, have been introduced and are now ubiquitously used in language generation systems. We propose a unified framework for understanding these techniques, which we term sampling adapters. Sampling adapters often lead to qualitatively better text, which raises the question: From a formal perspective, how are they changing the (sub)word-level distributions of language generation models? And why do these local changes lead to higher-quality text? We argue that the shift they enforce can be viewed as a trade-off between precision and recall: while the model loses its ability to produce certain strings, its…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rycolab/sampling-adapters
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.