Phylogeny-Inspired Adaptation of Multilingual Models to New Languages

Fahim Faisal; Antonios Anastasopoulos

arXiv:2205.09634·cs.CL·November 24, 2022·6 cites

Phylogeny-Inspired Adaptation of Multilingual Models to New Languages

Fahim Faisal, Antonios Anastasopoulos

PDF

Open Access 1 Repo

TL;DR

This paper introduces a phylogeny-inspired method for adapting multilingual models to new languages, leveraging linguistic relationships to improve transfer learning, especially for unseen languages, resulting in significant performance gains.

Contribution

It proposes a novel approach using language phylogenetic information to enhance cross-lingual transfer in multilingual models through adapter-based training.

Findings

01

Over 20% relative performance improvement on unseen languages.

02

Effective adaptation across diverse language families.

03

Enhanced syntactic and semantic task performance.

Abstract

Large pretrained multilingual models, trained on dozens of languages, have delivered promising results due to cross-lingual learning capabilities on variety of language tasks. Further adapting these models to specific languages, especially ones unseen during pre-training, is an important goal towards expanding the coverage of language technologies. In this study, we show how we can use language phylogenetic information to improve cross-lingual transfer leveraging closely related languages in a structured, linguistically-informed manner. We perform adapter-based training on languages from diverse language families (Germanic, Uralic, Tupian, Uto-Aztecan) and evaluate on both syntactic and semantic tasks, obtaining more than 20% relative performance improvements over strong commonly used baselines, especially on languages unseen during pre-training.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ffaisal93/adapt_lang_phylogeny
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Language and cultural evolution