Evaluating Morphological Compositional Generalization in Large Language Models

Mete Ismayilzada; Defne Circi; Jonne S\"alev\"a; Hale Sirin; Abdullatif K\"oksal; Bhuwan Dhingra; Antoine Bosselut; Duygu Ataman; Lonneke van der Plas

arXiv:2410.12656·cs.CL·June 6, 2025

Evaluating Morphological Compositional Generalization in Large Language Models

Mete Ismayilzada, Defne Circi, Jonne S\"alev\"a, Hale Sirin, Abdullatif K\"oksal, Bhuwan Dhingra, Antoine Bosselut, Duygu Ataman, Lonneke van der Plas

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper systematically evaluates how well large language models understand and generalize morphological structures, revealing significant limitations in their ability to handle novel words and complex morphology in agglutinative languages.

Contribution

The study introduces a novel suite of tasks to assess morphological compositionality in LLMs, highlighting their struggles with systematic generalization in agglutinative languages.

Findings

01

LLMs perform poorly on morphological generalization with novel roots

02

Models show limited systematicity in morphological composition

03

Performance declines with increased morphological complexity

Abstract

Large language models (LLMs) have demonstrated significant progress in various natural language generation and understanding tasks. However, their linguistic generalization capabilities remain questionable, raising doubts about whether these models learn language similarly to humans. While humans exhibit compositional generalization and linguistic creativity in language use, the extent to which LLMs replicate these abilities, particularly in morphology, is under-explored. In this work, we systematically investigate the morphological generalization abilities of LLMs through the lens of compositionality. We define morphemes as compositional primitives and design a novel suite of generative and discriminative tasks to assess morphological productivity and systematicity. Focusing on agglutinative languages such as Turkish and Finnish, we evaluate several state-of-the-art…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

selimfirat/bilkent-turkish-writings-dataset
noneOfficial

Videos

Evaluating Morphological Compositional Generalization in Large Language Models· underline

Taxonomy

TopicsNatural Language Processing Techniques

MethodsAttention Is All You Need · Adam · Dropout · Dense Connections · Layer Normalization · Residual Connection · Position-Wise Feed-Forward Layer · Linear Layer · Byte Pair Encoding · Absolute Position Encodings