Quantifying Adaptability in Pre-trained Language Models with 500 Tasks
Belinda Z. Li, Jane Yu, Madian Khabsa, Luke Zettlemoyer, Alon Halevy,, Jacob Andreas

TL;DR
This paper introduces TaskBench500, a comprehensive benchmark to empirically analyze the adaptability of pre-trained language models across 500 diverse tasks, revealing key factors influencing their performance and limitations.
Contribution
The paper presents a large-scale empirical study using a new benchmark, TaskBench500, to systematically evaluate and understand the factors affecting language model adaptability to new tasks.
Findings
Adaptation procedures vary greatly in memorization ability.
Some tasks show compositional adaptability among models.
Mismatch in label distribution difficulty explains adaptation failures.
Abstract
When a neural language model (LM) is adapted to perform a new task, what aspects of the task predict the eventual performance of the model? In NLP, systematic features of LM generalization to individual examples are well characterized, but systematic aspects of LM adaptability to new tasks are not nearly as well understood. We present a large-scale empirical study of the features and limits of LM adaptability using a new benchmark, TaskBench500, built from 500 procedurally generated sequence modeling tasks. These tasks combine core aspects of language processing, including lexical semantics, sequence processing, memorization, logical reasoning, and world knowledge. Using TaskBench500, we evaluate three facets of adaptability, finding that: (1) adaptation procedures differ dramatically in their ability to memorize small datasets; (2) within a subset of task types, adaptation procedures…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification
