Are Prompt-based Models Clueless?

Pride Kavumba; Ryo Takahashi; Yusuke Oda

arXiv:2205.09295·cs.CL·May 23, 2022

Are Prompt-based Models Clueless?

Pride Kavumba, Ryo Takahashi, Yusuke Oda

PDF

Open Access

TL;DR

This paper investigates whether few-shot prompt-based language models exploit superficial dataset cues, finding that they do, especially on instances with superficial cues, raising questions about their true understanding capabilities.

Contribution

It provides an empirical analysis showing that prompt-based models also exploit superficial cues, challenging assumptions about their robustness and understanding.

Findings

01

Prompt-based models exploit superficial cues in datasets.

02

Models perform poorly on instances without superficial cues.

03

Prompting does not fully eliminate reliance on superficial cues.

Abstract

Finetuning large pre-trained language models with a task-specific head has advanced the state-of-the-art on many natural language understanding benchmarks. However, models with a task-specific head require a lot of training data, making them susceptible to learning and exploiting dataset-specific superficial cues that do not generalize to other datasets. Prompting has reduced the data requirement by reusing the language model head and formatting the task input to match the pre-training objective. Therefore, it is expected that few-shot prompt-based models do not exploit superficial cues. This paper presents an empirical examination of whether few-shot prompt-based models also exploit superficial cues. Analyzing few-shot prompt-based models on MNLI, SNLI, HANS, and COPA has revealed that prompt-based models also exploit superficial cues. While the models perform well on instances with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications