Unsupervised Cross-lingual Adaptation for Sequence Tagging and Beyond

Xin Li; Lidong Bing; Wenxuan Zhang; Zheng Li; Wai Lam

arXiv:2010.12405·cs.CL·June 23, 2021·25 cites

Unsupervised Cross-lingual Adaptation for Sequence Tagging and Beyond

Xin Li, Lidong Bing, Wenxuan Zhang, Zheng Li, Wai Lam

PDF

Open Access

TL;DR

This paper introduces a novel warm-up based framework that combines zero-shot and translation-based methods to improve cross-lingual sequence tagging performance using multilingual pre-trained models.

Contribution

It proposes a tailored warm-up mechanism and a refined adaptation approach that enhances cross-lingual transfer for sequence tagging tasks.

Findings

01

Improved performance on nine target languages.

02

Effective combination of zero-shot and translation-based approaches.

03

Beneficial for various sequence tagging tasks.

Abstract

Cross-lingual adaptation with multilingual pre-trained language models (mPTLMs) mainly consists of two lines of works: zero-shot approach and translation-based approach, which have been studied extensively on the sequence-level tasks. We further verify the efficacy of these cross-lingual adaptation approaches by evaluating their performances on more fine-grained sequence tagging tasks. After re-examining their strengths and drawbacks, we propose a novel framework to consolidate the zero-shot approach and the translation-based approach for better adaptation performance. Instead of simply augmenting the source data with the machine-translated data, we tailor-make a warm-up mechanism to quickly update the mPTLMs with the gradients estimated on a few translated data. Then, the adaptation approach is applied to the refined parameters and the cross-lingual transfer is performed in a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications