The Impact of Language Adapters in Cross-Lingual Transfer for NLU
Jenny Kunz, Oskar Holmstr\"om

TL;DR
This study investigates the role of language adapters in zero-shot cross-lingual transfer for NLU, revealing that their impact varies and sometimes retaining source-language adapters yields better results.
Contribution
It provides detailed ablation studies on the effectiveness of target-language adapters in multilingual models for NLU tasks.
Findings
Target-language adapters have inconsistent effects across tasks and languages.
Retaining source-language adapters often matches or exceeds target-language adapter performance.
Removing language adapters after training has minimal negative impact.
Abstract
Modular deep learning has been proposed for the efficient adaption of pre-trained models to new tasks, domains and languages. In particular, combining language adapters with task adapters has shown potential where no supervised data exists for a language. In this paper, we explore the role of language adapters in zero-shot cross-lingual transfer for natural language understanding (NLU) benchmarks. We study the effect of including a target-language adapter in detailed ablation studies with two multilingual models and three multilingual datasets. Our results show that the effect of target-language adapters is highly inconsistent across tasks, languages and models. Retaining the source-language adapter instead often leads to an equivalent, and sometimes to a better, performance. Removing the language adapter after training has only a weak negative effect, indicating that the language…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems
MethodsAdapter
