Towards a Common Understanding of Contributing Factors for Cross-Lingual   Transfer in Multilingual Language Models: A Review

Fred Philippy; Siwen Guo; Shohreh Haddadan

arXiv:2305.16768·cs.CL·May 29, 2023·1 cites

Towards a Common Understanding of Contributing Factors for Cross-Lingual Transfer in Multilingual Language Models: A Review

Fred Philippy, Siwen Guo, Shohreh Haddadan

PDF

Open Access

TL;DR

This review paper consolidates and analyzes existing research on the factors influencing the cross-lingual transfer capabilities of multilingual language models, providing a structured overview and guidance for future work.

Contribution

It categorizes and synthesizes empirical findings on factors affecting cross-lingual transfer in MLLMs, unifying diverse research streams and resolving conflicting results.

Findings

01

Identifies five key categories of contributing factors.

02

Summarizes empirical evidence and consensus among studies.

03

Provides guidance for future research and practical application.

Abstract

In recent years, pre-trained Multilingual Language Models (MLLMs) have shown a strong ability to transfer knowledge across different languages. However, given that the aspiration for such an ability has not been explicitly incorporated in the design of the majority of MLLMs, it is challenging to obtain a unique and straightforward explanation for its emergence. In this review paper, we survey literature that investigates different factors contributing to the capacity of MLLMs to perform zero-shot cross-lingual transfer and subsequently outline and discuss these factors in detail. To enhance the structure of this review and to facilitate consolidation with future studies, we identify five categories of such factors. In addition to providing a summary of empirical evidence from past studies, we identify consensuses among studies with consistent findings and resolve conflicts among…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques