Few-shot Fine-tuning vs. In-context Learning: A Fair Comparison and   Evaluation

Marius Mosbach; Tiago Pimentel; Shauli Ravfogel; Dietrich Klakow,; Yanai Elazar

arXiv:2305.16938·cs.CL·May 31, 2023·6 cites

Few-shot Fine-tuning vs. In-context Learning: A Fair Comparison and Evaluation

Marius Mosbach, Tiago Pimentel, Shauli Ravfogel, Dietrich Klakow,, Yanai Elazar

PDF

Open Access 1 Repo 1 Datasets

TL;DR

This paper compares few-shot fine-tuning and in-context learning for language models, controlling for model size and examples, and finds both approaches have similar out-of-domain generalization, with robustness still a challenge.

Contribution

It provides a fair comparison of fine-tuning and in-context learning across various model sizes, clarifying their relative generalization capabilities.

Findings

01

Both methods show similar out-of-domain generalization.

02

Model size and number of examples significantly affect performance.

03

Robust task adaptation remains a challenge for both approaches.

Abstract

Few-shot fine-tuning and in-context learning are two alternative strategies for task adaptation of pre-trained language models. Recently, in-context learning has gained popularity over fine-tuning due to its simplicity and improved out-of-domain generalization, and because extensive evidence shows that fine-tuned models pick up on spurious correlations. Unfortunately, previous comparisons of the two approaches were done using models of different sizes. This raises the question of whether the observed weaker out-of-domain generalization of fine-tuned models is an inherent property of fine-tuning or a limitation of the experimental setup. In this paper, we compare the generalization of few-shot fine-tuning and in-context learning to challenge datasets, while controlling for the models used, the number of examples, and the number of parameters, ranging from 125M to 30B. Our results show…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

uds-lsv/llmft
pytorchOfficial

Datasets

Hugodonotexit/dolma-en
dataset· 1.9k dl
1.9k dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications