MultiSlav: Using Cross-Lingual Knowledge Transfer to Combat the Curse of Multilinguality
Artur Kot, Miko{\l}aj Koszowski, Wojciech Chojnowski, Mieszko, Rutkowski, Artur Nowakowski, Kamil Guttmann, Miko{\l}aj Pokrywka

TL;DR
This paper investigates cross-lingual transfer in multilingual NMT for Slavic languages, demonstrating benefits in low-resource and zero-shot translation, and provides open-source models to advance research in under-studied language families.
Contribution
It introduces novel approaches for leveraging cross-lingual knowledge transfer in Slavic NMT and releases state-of-the-art models for these languages.
Findings
Cross-lingual transfer improves low-resource translation quality.
Zero-shot translation benefits are demonstrated in Slavic languages.
Open-source models are provided for community use.
Abstract
Does multilingual Neural Machine Translation (NMT) lead to The Curse of the Multlinguality or provides the Cross-lingual Knowledge Transfer within a language family? In this study, we explore multiple approaches for extending the available data-regime in NMT and we prove cross-lingual benefits even in 0-shot translation regime for low-resource languages. With this paper, we provide state-of-the-art open-source NMT models for translating between selected Slavic languages. We released our models on the HuggingFace Hub (https://hf.co/collections/allegro/multislav-6793d6b6419e5963e759a683) under the CC BY 4.0 license. Slavic language family comprises morphologically rich Central and Eastern European languages. Although counting hundreds of millions of native speakers, Slavic Neural Machine Translation is under-studied in our opinion. Recently, most NMT research focuses either on:…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques
