Mission: Impossible Language Models
Julie Kallini, Isabel Papadimitriou, Richard Futrell, Kyle Mahowald,, Christopher Potts

TL;DR
This study tests GPT-2's ability to learn synthetic impossible languages, revealing limitations and challenging claims that LLMs can learn languages impossible for humans, thus informing linguistic and cognitive research.
Contribution
The paper introduces a systematic set of synthetic impossible languages and evaluates GPT-2's learning capacity across these, providing empirical evidence on LLMs' limitations.
Findings
GPT-2 struggles with impossible languages compared to English.
Learning impossible languages varies with language complexity.
Results challenge the claim that LLMs can learn all language types.
Abstract
Chomsky and others have very directly claimed that large language models (LLMs) are equally capable of learning languages that are possible and impossible for humans to learn. However, there is very little published experimental evidence to support such a claim. Here, we develop a set of synthetic impossible languages of differing complexity, each designed by systematically altering English data with unnatural word orders and grammar rules. These languages lie on an impossibility continuum: at one end are languages that are inherently impossible, such as random and irreversible shuffles of English words, and on the other, languages that may not be intuitively impossible but are often considered so in linguistics, particularly those with rules based on counting word positions. We report on a wide range of evaluations to assess the capacity of GPT-2 small models to learn these…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗Yaning1001/Impossible_llmmodel
- 🤗mission-impossible-lms/no-shuffle-gpt2model· 93 dl93 dl
- 🤗mission-impossible-lms/nondeterministic-shuffle-gpt2model· 66 dl66 dl
- 🤗mission-impossible-lms/deterministic-shuffle-s21-gpt2model· 75 dl75 dl
- 🤗mission-impossible-lms/deterministic-shuffle-s57-gpt2model· 1 dl1 dl
- 🤗mission-impossible-lms/deterministic-shuffle-s84-gpt2model· 2 dl2 dl
- 🤗mission-impossible-lms/local-shuffle-w3-gpt2model· 123 dl123 dl
- 🤗mission-impossible-lms/local-shuffle-w5-gpt2model· 60 dl60 dl
- 🤗mission-impossible-lms/local-shuffle-w10-gpt2model· 65 dl65 dl
- 🤗mission-impossible-lms/even-odd-shuffle-gpt2model· 60 dl60 dl
Videos
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification
MethodsSparse Evolutionary Training · Refunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Dropout · Adam · Cosine Annealing · Dense Connections · Linear Warmup With Cosine Annealing
