Lightweight Decoding Strategies for Increasing Specificity

Katy Ilonka Gero; Chris Kedzie; Savvas Petridis; Lydia Chilton

arXiv:2110.11850·cs.CL·October 25, 2021

Lightweight Decoding Strategies for Increasing Specificity

Katy Ilonka Gero, Chris Kedzie, Savvas Petridis, Lydia Chilton

PDF

Open Access

TL;DR

This paper introduces two unsupervised decoding strategies that enhance the specificity of language model outputs, making responses more detailed and less generic, with minimal loss of sensibility.

Contribution

It proposes novel decoding methods based on word-frequency and mutual information to improve output specificity in language models.

Findings

01

Both strategies increase output specificity in prompt completion tasks.

02

Strategies cause only modest decreases in sensibility.

03

Applicable to summarization for more specific summaries.

Abstract

Language models are known to produce vague and generic outputs. We propose two unsupervised decoding strategies based on either word-frequency or point-wise mutual information to increase the specificity of any model that outputs a probability distribution over its vocabulary at generation time. We test the strategies in a prompt completion task; with human evaluations, we find that both strategies increase the specificity of outputs with only modest decreases in sensibility. We also briefly present a summarization use case, where these strategies can produce more specific summaries.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems

MethodsTest