TL;DR
This paper introduces a unified framework called sampling adapters that modifies language model sampling distributions, improving text quality by balancing precision and recall, and correlating with higher sequence-level quality scores.
Contribution
It formalizes sampling adapters as a trade-off between precision and recall, explaining their effectiveness in enhancing text quality beyond standard metrics.
Findings
Sampling adapters increase the alignment of generated text with true distributions.
They improve sequence-level quality scores such as Mauve.
Local distribution changes lead to more coherent and grammatical text.
Abstract
Sampling is a common strategy for generating text from probabilistic models, yet standard ancestral sampling often results in text that is incoherent or ungrammatical. To alleviate this issue, various modifications to a model's sampling distribution, such as nucleus or top-k sampling, have been introduced and are now ubiquitously used in language generation systems. We propose a unified framework for understanding these techniques, which we term sampling adapters. Sampling adapters often lead to qualitatively better text, which raises the question: From a formal perspective, how are they changing the (sub)word-level distributions of language generation models? And why do these local changes lead to higher-quality text? We argue that the shift they enforce can be viewed as a trade-off between precision and recall: while the model loses its ability to produce certain strings, its…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
