Community size rather than grammatical complexity better predicts Large Language Model accuracy in a novel Wug Test

Nikoleta Pantelidou; Evelina Leivada; Raquel Montero; Paolo Morosi; Wei Lun Wong; Wei Lun Wong; Wei Lun Wong

PMC · DOI:10.1371/journal.pone.0343164·March 11, 2026

Community size rather than grammatical complexity better predicts Large Language Model accuracy in a novel Wug Test

Nikoleta Pantelidou, Evelina Leivada, Raquel Montero, Paolo Morosi, Wei Lun Wong, Wei Lun Wong, Wei Lun Wong

PDF

Open Access

TL;DR

This study finds that the accuracy of large language models in a word generalization task is more influenced by the size of the language community than by grammatical complexity.

Contribution

The study introduces a multilingual Wug Test to compare model performance with human speakers across four languages.

Findings

01

Model accuracy in morphological generalization aligns more with community size and data availability than with grammatical complexity.

02

Languages with larger speaker communities, like Spanish and English, showed higher model accuracy.

03

Model behavior resembles human linguistic competence superficially but is driven by resource richness.

Abstract

The linguistic abilities of Large Language Models are a matter of ongoing debate. This study contributes to this discussion by investigating model performance in a morphological generalization task that involves novel words. Using a multilingual adaptation of the Wug Test, six models were tested across four partially unrelated languages (Catalan, English, Greek, and Spanish) and compared with human speakers. The aim is to determine whether model accuracy approximates human competence and whether it is shaped primarily by linguistic complexity or by the size of the linguistic community, which affects the quantity of available training data. Consistent with previous research, the results show that the models are able to generalize morphological processes to unseen words with human-like accuracy. However, accuracy patterns align more closely with community size and data availability than…

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Species1

Homo sapiens(human · species)

Chemicals1

BERT

Diseases4

cognitive, neurological, hearing, or speech-related impairments cognitive fatigue LLMs attention lapses

Figures12

Click any figure to enlarge with its caption.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeurobiology of Language and Bilingualism · Language Development and Disorders · Phonetics and Phonology Research