Unpacking Let Alone: Human-Scale Models Generalize to a Rare Construction in Form but not Meaning
Wesley Scivetti, Tatsuya Aoyama, Ethan Wilcox, Nathan Schneider

TL;DR
This paper investigates whether human-scale language models can generalize knowledge of rare grammatical constructions, specifically LET-ALONE, in both form and meaning, revealing they understand form but not meaning, unlike humans.
Contribution
It introduces a synthetic benchmark to evaluate language models' understanding of rare constructions' form and meaning, highlighting an asymmetry in their generalization abilities.
Findings
Models recognize form but not meaning of LET-ALONE
Models generalize form from frequent to rare constructions
Humans understand both form and meaning of rare constructions
Abstract
Humans have a remarkable ability to acquire and understand grammatical phenomena that are seen rarely, if ever, during childhood. Recent evidence suggests that language models with human-scale pretraining data may possess a similar ability by generalizing from frequent to rare constructions. However, it remains an open question how widespread this generalization ability is, and to what extent this knowledge extends to meanings of rare constructions, as opposed to just their forms. We fill this gap by testing human-scale transformer language models on their knowledge of both the form and meaning of the (rare and quirky) English LET-ALONE construction. To evaluate our LMs we construct a bespoke synthetic benchmark that targets syntactic and semantic properties of the construction. We find that human-scale LMs are sensitive to form, even when related constructions are filtered from the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsLanguage Development and Disorders · Language and cultural evolution · Topic Modeling
