Boosting Cross-Lingual Transfer via Self-Learning with Uncertainty   Estimation

Liyan Xu; Xuchao Zhang; Xujiang Zhao; Haifeng Chen; Feng Chen; Jinho; D. Choi

arXiv:2109.00194·cs.CL·September 24, 2021

Boosting Cross-Lingual Transfer via Self-Learning with Uncertainty Estimation

Liyan Xu, Xuchao Zhang, Xujiang Zhao, Haifeng Chen, Feng Chen, Jinho, D. Choi

PDF

Open Access 1 Repo

TL;DR

This paper introduces a self-learning approach with uncertainty estimation to improve cross-lingual transfer in multilingual models, significantly boosting performance on NER and NLI tasks across 40 languages.

Contribution

It proposes a novel self-learning framework utilizing uncertainty estimation to select high-quality pseudo-labels for better cross-lingual transfer.

Findings

01

Outperforms baselines by 10 F1 on NER

02

Achieves 2.5 higher accuracy on NLI

03

Effective across 40 languages

Abstract

Recent multilingual pre-trained language models have achieved remarkable zero-shot performance, where the model is only finetuned on one source language and directly evaluated on target languages. In this work, we propose a self-learning framework that further utilizes unlabeled data of target languages, combined with uncertainty estimation in the process to select high-quality silver labels. Three different uncertainties are adapted and analyzed specifically for the cross lingual transfer: Language Heteroscedastic/Homoscedastic Uncertainty (LEU/LOU), Evidential Uncertainty (EVI). We evaluate our framework with uncertainties on two cross-lingual tasks including Named Entity Recognition (NER) and Natural Language Inference (NLI) covering 40 languages in total, which outperforms the baselines significantly by 10 F1 on average for NER and 2.5 accuracy score for NLI.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lxucs/multilingual-sl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

MethodsSelf-Learning