Can Language Models Learn Typologically Implausible Languages?
Tianyang Xu, Tatsuki Kuribayashi, Yohei Oseki, Ryan Cotterell, Alex, Warstadt

TL;DR
This study investigates whether language models can learn typologically implausible languages, revealing that they show some preferences aligned with natural language patterns, suggesting domain-general biases influence language universals.
Contribution
The paper provides the first large-scale, naturalistic assessment of LMs learning plausible and implausible languages, highlighting their biases and learning dynamics in typologically diverse settings.
Findings
LMs are slower to learn implausible languages.
LMs achieve similar performance on some metrics regardless of plausibility.
Results support the role of domain-general biases in language learning.
Abstract
Grammatical features across human languages show intriguing correlations often attributed to learning biases in humans. However, empirical evidence has been limited to experiments with highly simplified artificial languages, and whether these correlations arise from domain-general or language-specific biases remains a matter of debate. Language models (LMs) provide an opportunity to study artificial language learning at a large scale and with a high degree of naturalism. In this paper, we begin with an in-depth discussion of how LMs allow us to better determine the role of domain-general learning biases in language universals. We then assess learnability differences for LMs resulting from typologically plausible and implausible languages closely following the word-order universals identified by linguistic typologists. We conduct a symmetrical cross-lingual study training and testing LMs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques
