Avoiding Inference Heuristics in Few-shot Prompt-based Finetuning
Prasetya Ajie Utama, Nafise Sadat Moosavi, Victor Sanh, Iryna Gurevych

TL;DR
This paper investigates how prompt-based finetuning of language models for sentence pair classification can lead to reliance on inference heuristics like lexical overlap, and proposes regularization techniques to mitigate this issue, improving robustness.
Contribution
It identifies the problem of inference heuristics in finetuned prompt-based models and introduces a regularization method to preserve pretraining knowledge, reducing heuristic reliance.
Findings
Finetuned models often adopt lexical overlap heuristics.
Regularization helps preserve pretraining knowledge.
Improved performance on challenge datasets.
Abstract
Recent prompt-based approaches allow pretrained language models to achieve strong performances on few-shot finetuning by reformulating downstream tasks as a language modeling problem. In this work, we demonstrate that, despite its advantages on low data regimes, finetuned prompt-based models for sentence pair classification tasks still suffer from a common pitfall of adopting inference heuristics based on lexical overlap, e.g., models incorrectly assuming a sentence pair is of the same meaning because they consist of the same set of words. Interestingly, we find that this particular inference heuristic is significantly less present in the zero-shot evaluation of the prompt-based model, indicating how finetuning can be destructive to useful knowledge learned during the pretraining. We then show that adding a regularization that preserves pretraining weights is effective in mitigating…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification
