X-METRA-ADA: Cross-lingual Meta-Transfer Learning Adaptation to Natural Language Understanding and Question Answering
Meryem M'hamdi, Doo Soon Kim, Franck Dernoncourt, Trung Bui, Xiang, Ren, and Jonathan May

TL;DR
X-METRA-ADA introduces a meta-learning based adaptation method that improves cross-lingual transfer for natural language understanding tasks, especially in low-resource and diverse language scenarios.
Contribution
It adapts MAML for cross-lingual NLU, demonstrating improved performance over naive fine-tuning on multilingual dialog and question answering tasks.
Findings
Outperforms naive fine-tuning on multiple languages
Leverages limited data for faster adaptation
Achieves competitive results on diverse languages
Abstract
Multilingual models, such as M-BERT and XLM-R, have gained increasing popularity, due to their zero-shot cross-lingual transfer learning capabilities. However, their generalization ability is still inconsistent for typologically diverse languages and across different benchmarks. Recently, meta-learning has garnered attention as a promising technique for enhancing transfer learning under low-resource scenarios: particularly for cross-lingual transfer in Natural Language Understanding (NLU). In this work, we propose X-METRA-ADA, a cross-lingual MEta-TRAnsfer learning ADAptation approach for NLU. Our approach adapts MAML, an optimization-based meta-learning approach, to learn to adapt to new languages. We extensively evaluate our framework on two challenging cross-lingual NLU tasks: multilingual task-oriented dialog and typologically diverse question answering. We show that our approach…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsXLM-R · Model-Agnostic Meta-Learning
