Prompt Waywardness: The Curious Case of Discretized Interpretation of   Continuous Prompts

Daniel Khashabi; Shane Lyu; Sewon Min; Lianhui Qin; Kyle Richardson,; Sean Welleck; Hannaneh Hajishirzi; Tushar Khot; Ashish Sabharwal; Sameer; Singh; Yejin Choi

arXiv:2112.08348·cs.CL·May 5, 2022·1 cites

Prompt Waywardness: The Curious Case of Discretized Interpretation of Continuous Prompts

Daniel Khashabi, Shane Lyu, Sewon Min, Lianhui Qin, Kyle Richardson,, Sean Welleck, Hannaneh Hajishirzi, Tushar Khot, Ashish Sabharwal, Sameer, Singh, Yejin Choi

PDF

Open Access 1 Repo

TL;DR

This paper explores the challenge of interpreting continuous prompts in language models by examining their discretized counterparts, revealing surprising 'wayward' behaviors and implications for faithful interpretation and generalization.

Contribution

It uncovers the phenomenon of waywardness in discretized prompts, providing empirical analysis and insights into the behavior of prompts across model sizes and tasks.

Findings

01

Larger models exhibit higher waywardness, mapping prompts more closely to arbitrary texts.

02

Continuous prompts can solve tasks while being projected to contradictory or unrelated texts.

03

Faithful interpretation of continuous prompts is fundamentally challenging due to observed behaviors.

Abstract

Fine-tuning continuous prompts for target tasks has recently emerged as a compact alternative to full model fine-tuning. Motivated by these promising results, we investigate the feasibility of extracting a discrete (textual) interpretation of continuous prompts that is faithful to the problem they solve. In practice, we observe a "wayward" behavior between the task solved by continuous prompts and their nearest neighbor discrete projections: We can find continuous prompts that solve a task while being projected to an arbitrary text (e.g., definition of a different or even a contradictory task), while being within a very small (2%) margin of the best continuous prompt of the same size for the task. We provide intuitions behind this odd and surprising behavior, as well as extensive empirical analyses quantifying the effect of various parameters. For instance, for larger model sizes we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

alrope123/prompt-waywardness
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems