The Impact of Language Adapters in Cross-Lingual Transfer for NLU

Jenny Kunz; Oskar Holmstr\"om

arXiv:2402.00149·cs.CL·December 18, 2024·1 cites

The Impact of Language Adapters in Cross-Lingual Transfer for NLU

Jenny Kunz, Oskar Holmstr\"om

PDF

Open Access 1 Repo

TL;DR

This study investigates the role of language adapters in zero-shot cross-lingual transfer for NLU, revealing that their impact varies and sometimes retaining source-language adapters yields better results.

Contribution

It provides detailed ablation studies on the effectiveness of target-language adapters in multilingual models for NLU tasks.

Findings

01

Target-language adapters have inconsistent effects across tasks and languages.

02

Retaining source-language adapters often matches or exceeds target-language adapter performance.

03

Removing language adapters after training has minimal negative impact.

Abstract

Modular deep learning has been proposed for the efficient adaption of pre-trained models to new tasks, domains and languages. In particular, combining language adapters with task adapters has shown potential where no supervised data exists for a language. In this paper, we explore the role of language adapters in zero-shot cross-lingual transfer for natural language understanding (NLU) benchmarks. We study the effect of including a target-language adapter in detailed ablation studies with two multilingual models and three multilingual datasets. Our results show that the effect of target-language adapters is highly inconsistent across tasks, languages and models. Retaining the source-language adapter instead often leads to an equivalent, and sometimes to a better, performance. Removing the language adapter after training has only a weak negative effect, indicating that the language…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

oskarholmstrom/lang-adapters-impact
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems

MethodsAdapter