Prompt-Tuning Can Be Much Better Than Fine-Tuning on Cross-lingual   Understanding With Multilingual Language Models

Lifu Tu; Caiming Xiong; Yingbo Zhou

arXiv:2210.12360·cs.CL·December 14, 2022

Prompt-Tuning Can Be Much Better Than Fine-Tuning on Cross-lingual Understanding With Multilingual Language Models

Lifu Tu, Caiming Xiong, Yingbo Zhou

PDF

Open Access 2 Repos

TL;DR

This paper demonstrates that prompt tuning significantly outperforms traditional fine-tuning for cross-lingual transfer in multilingual models, using minimal parameter updates across various NLU tasks.

Contribution

It introduces prompt tuning as a superior alternative to fine-tuning for cross-lingual transfer, with extensive evaluation across multiple tasks and languages.

Findings

01

Prompt tuning outperforms fine-tuning in cross-lingual transfer.

02

Only 0.1% to 0.3% parameters are tuned in prompt tuning.

03

Prompt tuning achieves better aligned decision boundaries.

Abstract

Pre-trained multilingual language models show significant performance gains for zero-shot cross-lingual model transfer on a wide range of natural language understanding (NLU) tasks. Previously, for zero-shot cross-lingual evaluation, pre-trained models are only fine-tuned on English data and tested on a variety of target languages. In this paper, we do cross-lingual evaluation on various NLU tasks (sentence classification, sequence labeling, question answering) using prompt-tuning and compare it with fine-tuning. The results show that prompt tuning achieves much better cross-lingual transfer than fine-tuning across datasets, with only 0.1% to 0.3% tuned parameters. Additionally, we demonstrate through the analysis that prompt tuning can have better cross-lingual transferability of representations on downstream tasks with better aligned decision boundaries.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications