Can Language Models Learn Typologically Implausible Languages?

Tianyang Xu; Tatsuki Kuribayashi; Yohei Oseki; Ryan Cotterell; Alex; Warstadt

arXiv:2502.12317·cs.CL·February 19, 2025

Can Language Models Learn Typologically Implausible Languages?

Tianyang Xu, Tatsuki Kuribayashi, Yohei Oseki, Ryan Cotterell, Alex, Warstadt

PDF

Open Access

TL;DR

This study investigates whether language models can learn typologically implausible languages, revealing that they show some preferences aligned with natural language patterns, suggesting domain-general biases influence language universals.

Contribution

The paper provides the first large-scale, naturalistic assessment of LMs learning plausible and implausible languages, highlighting their biases and learning dynamics in typologically diverse settings.

Findings

01

LMs are slower to learn implausible languages.

02

LMs achieve similar performance on some metrics regardless of plausibility.

03

Results support the role of domain-general biases in language learning.

Abstract

Grammatical features across human languages show intriguing correlations often attributed to learning biases in humans. However, empirical evidence has been limited to experiments with highly simplified artificial languages, and whether these correlations arise from domain-general or language-specific biases remains a matter of debate. Language models (LMs) provide an opportunity to study artificial language learning at a large scale and with a high degree of naturalism. In this paper, we begin with an in-depth discussion of how LMs allow us to better determine the role of domain-general learning biases in language universals. We then assess learnability differences for LMs resulting from typologically plausible and implausible languages closely following the word-order universals identified by linguistic typologists. We conduct a symmetrical cross-lingual study training and testing LMs…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques