Efficient Test Time Adapter Ensembling for Low-resource Language Varieties
Xinyi Wang, Yulia Tsvetkov, Sebastian Ruder, Graham Neubig

TL;DR
This paper introduces EMEA, a method that ensembles multiple language adapters with optimized weights to improve robustness and performance on low-resource and unseen language varieties in NLP tasks.
Contribution
The paper proposes EMEA, a novel ensemble method that optimizes adapter weights per test sentence to enhance multilingual model robustness without additional training.
Findings
EMEA significantly improves NER and POS tagging across diverse language varieties.
Ensembling adapters increases robustness to unseen languages.
Optimizing ensemble weights per sentence yields better results than fixed weights.
Abstract
Adapters are light-weight modules that allow parameter-efficient fine-tuning of pretrained models. Specialized language and task adapters have recently been proposed to facilitate cross-lingual transfer of multilingual pretrained models (Pfeiffer et al., 2020b). However, this approach requires training a separate language adapter for every language one wishes to support, which can be impractical for languages with limited data. An intuitive solution is to use a related language adapter for the new language variety, but we observe that this solution can lead to sub-optimal performance. In this paper, we aim to improve the robustness of language adapters to uncovered languages without training new adapters. We find that ensembling multiple existing language adapters makes the fine-tuned model significantly more robust to other language varieties not included in these adapters. Building…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsEntropy Minimized Ensemble of Adapters · Adapter
