Few-shot Adaptation Works with UnpredicTable Data
Jun Shern Chan, Michael Pieler, Jonathan Jao, J\'er\'emy Scheurer,, Ethan Perez

TL;DR
This paper demonstrates that fine-tuning language models on large, narrow datasets extracted from internet tables can significantly improve few-shot learning performance across diverse NLP tasks, often outperforming broader datasets.
Contribution
The study introduces a massive dataset of over 413,000 tasks from internet tables and shows that narrow, specific datasets can enhance FSL more effectively than larger, diverse datasets.
Findings
Fine-tuning on narrow datasets like software documentation improves FSL by +7.5%.
Narrow datasets can outperform broader datasets in FSL performance.
Gains from datasets are not solely due to domain similarity, indicating other factors at play.
Abstract
Prior work on language models (LMs) shows that training on a large number of diverse tasks improves few-shot learning (FSL) performance on new tasks. We take this to the extreme, automatically extracting 413,299 tasks from internet tables - orders of magnitude more than the next-largest public datasets. Finetuning on the resulting dataset leads to improved FSL performance on Natural Language Processing (NLP) tasks, but not proportionally to dataset scale. In fact, we find that narrow subsets of our dataset sometimes outperform more diverse datasets. For example, finetuning on software documentation from support.google.com raises FSL performance by a mean of +7.5% on 52 downstream tasks, which beats training on 40 human-curated NLP datasets (+6.7%). Finetuning on various narrow datasets leads to similar broad improvements across test tasks, suggesting that the gains are not from domain…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
MethodsTest
