Language acquisition: do children and language models follow similar learning stages?
Linnea Evanson, Yair Lakretz, Jean-R\'emi King

TL;DR
This study compares the language learning stages of children and GPT-2 models, revealing similarities in systematic learning order and parallel development, but also highlighting key differences in their acquisition processes.
Contribution
The paper provides the first detailed comparison of language acquisition stages between children and deep language models, revealing both parallels and divergences.
Findings
Language models learn linguistic skills in a systematic order similar to children.
Learning tasks that are learned last improve from early training stages.
Some learning stages are shared between children and models, others are not.
Abstract
During language acquisition, children follow a typical sequence of learning stages, whereby they first learn to categorize phonemes before they develop their lexicon and eventually master increasingly complex syntactic structures. However, the computational principles that lead to this learning trajectory remain largely unknown. To investigate this, we here compare the learning trajectories of deep language models to those of children. Specifically, we test whether, during its training, GPT-2 exhibits stages of language acquisition comparable to those observed in children aged between 18 months and 6 years. For this, we train 48 GPT-2 models from scratch and evaluate their syntactic and semantic abilities at each training step, using 96 probes curated from the BLiMP, Zorro and BIG-Bench benchmarks. We then compare these evaluations with the behavior of 54 children during language…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Cosine Annealing · Linear Layer · Attention Dropout · Layer Normalization · Byte Pair Encoding · Test · Softmax · Dense Connections
