Minimax and Neyman-Pearson Meta-Learning for Outlier Languages

Edoardo Maria Ponti; Rahul Aralikatte; Disha Shrivastava; Siva Reddy,; Anders S{\o}gaard

arXiv:2106.01051·cs.CL·June 3, 2021

Minimax and Neyman-Pearson Meta-Learning for Outlier Languages

Edoardo Maria Ponti, Rahul Aralikatte, Disha Shrivastava, Siva Reddy,, Anders S{\o}gaard

PDF

1 Repo

TL;DR

This paper introduces Minimax and Neyman-Pearson meta-learning variants of MAML to improve robustness for outlier languages in cross-lingual NLP, demonstrating enhanced performance on low-resource language tasks.

Contribution

It proposes two novel meta-learning algorithms based on alternative risk criteria, addressing MAML's limitations with outlier languages in cross-lingual NLP.

Findings

01

Improved average and minimum performance on low-resource languages.

02

Effective in zero- and few-shot learning scenarios.

03

Outperforms joint transfer and vanilla MAML.

Abstract

Model-agnostic meta-learning (MAML) has been recently put forth as a strategy to learn resource-poor languages in a sample-efficient fashion. Nevertheless, the properties of these languages are often not well represented by those available during training. Hence, we argue that the i.i.d. assumption ingrained in MAML makes it ill-suited for cross-lingual NLP. In fact, under a decision-theoretic framework, MAML can be interpreted as minimising the expected risk across training languages (with a uniform prior), which is known as Bayes criterion. To increase its robustness to outlier languages, we create two variants of MAML based on alternative criteria: Minimax MAML reduces the maximum risk across languages, while Neyman-Pearson MAML constrains the risk in each language to a maximum threshold. Both criteria constitute fully differentiable two-player games. In light of this, we propose a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rahular/robust-maml
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsModel-Agnostic Meta-Learning