Flexing in 73 Languages: A Single Small Model for Multilingual Inflection

Tom\'a\v{s} Sourada; Jana Strakov\'a

arXiv:2510.23114·cs.CL·October 28, 2025

Flexing in 73 Languages: A Single Small Model for Multilingual Inflection

Tom\'a\v{s} Sourada, Jana Strakov\'a

PDF

TL;DR

This paper introduces a compact, multilingual inflection model trained on 73 languages that outperforms monolingual models, demonstrating the benefits of a single, robust system for morphological inflection across diverse languages.

Contribution

The authors develop a lightweight, single-model approach for multilingual inflection that handles unseen words and simplifies deployment, outperforming monolingual baselines across many languages.

Findings

01

Outperforms monolingual models in most languages

02

Robust to unseen words across languages

03

Simplifies deployment by using a single model

Abstract

We present a compact, single-model approach to multilingual inflection, the task of generating inflected word forms from base lemmas to express grammatical categories. Our model, trained jointly on data from 73 languages, is lightweight, robust to unseen words, and outperforms monolingual baselines in most languages. This demonstrates the effectiveness of multilingual modeling for inflection and highlights its practical benefits: simplifying deployment by eliminating the need to manage and retrain dozens of separate monolingual models. In addition to the standard SIGMORPHON shared task benchmarks, we evaluate our monolingual and multilingual models on 73 Universal Dependencies (UD) treebanks, extracting lemma-tag-form triples and their frequency counts. To ensure realistic data splits, we introduce a novel frequency-weighted, lemma-disjoint train-dev-test resampling procedure. Our work…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.