Avoiding Inference Heuristics in Few-shot Prompt-based Finetuning

Prasetya Ajie Utama; Nafise Sadat Moosavi; Victor Sanh; Iryna Gurevych

arXiv:2109.04144·cs.CL·September 10, 2021

Avoiding Inference Heuristics in Few-shot Prompt-based Finetuning

Prasetya Ajie Utama, Nafise Sadat Moosavi, Victor Sanh, Iryna Gurevych

PDF

Open Access 1 Repo

TL;DR

This paper investigates how prompt-based finetuning of language models for sentence pair classification can lead to reliance on inference heuristics like lexical overlap, and proposes regularization techniques to mitigate this issue, improving robustness.

Contribution

It identifies the problem of inference heuristics in finetuned prompt-based models and introduces a regularization method to preserve pretraining knowledge, reducing heuristic reliance.

Findings

01

Finetuned models often adopt lexical overlap heuristics.

02

Regularization helps preserve pretraining knowledge.

03

Improved performance on challenge datasets.

Abstract

Recent prompt-based approaches allow pretrained language models to achieve strong performances on few-shot finetuning by reformulating downstream tasks as a language modeling problem. In this work, we demonstrate that, despite its advantages on low data regimes, finetuned prompt-based models for sentence pair classification tasks still suffer from a common pitfall of adopting inference heuristics based on lexical overlap, e.g., models incorrectly assuming a sentence pair is of the same meaning because they consist of the same set of words. Interestingly, we find that this particular inference heuristic is significantly less present in the zero-shot evaluation of the prompt-based model, indicating how finetuning can be destructive to useful knowledge learned during the pretraining. We then show that adding a regularization that preserves pretraining weights is effective in mitigating…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ukplab/emnlp2021-prompt-ft-heuristics
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification