Zero-Shot Cross-lingual Classification Using Multilingual Neural Machine Translation
Akiko Eriguchi, Melvin Johnson, Orhan Firat, Hideto Kazawa, Wolfgang, Macherey

TL;DR
This paper introduces a multilingual encoder-classifier framework that leverages multilingual NMT systems for zero-shot cross-lingual classification, achieving competitive results across multiple benchmarks and demonstrating broad applicability of learned representations.
Contribution
The paper presents a simple yet effective method to reuse multilingual NMT encoders for zero-shot cross-lingual classification, showing significant improvements and broad applicability.
Findings
Significant improvements on English benchmark tasks
Effective zero-shot classification in unseen languages
Shared representations from NMT are broadly applicable
Abstract
Transferring representations from large supervised tasks to downstream tasks has shown promising results in AI fields such as Computer Vision and Natural Language Processing (NLP). In parallel, the recent progress in Machine Translation (MT) has enabled one to train multilingual Neural MT (NMT) systems that can translate between multiple languages and are also capable of performing zero-shot translation. However, little attention has been paid to leveraging representations learned by a multilingual NMT system to enable zero-shot multilinguality in other NLP tasks. In this paper, we demonstrate a simple framework, a multilingual Encoder-Classifier, for cross-lingual transfer learning by reusing the encoder from a multilingual NMT system and stitching it with a task-specific classifier component. Our proposed model achieves significant improvements in the English setup on three benchmark…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis
