Languages Transferred Within the Encoder: On Representation Transfer in Zero-Shot Multilingual Translation
Zhi Qu, Chenchen Ding, Taro Watanabe

TL;DR
This paper investigates how multilingual neural machine translation models transfer language representations within the encoder, revealing that transfer is language-specific rather than language-agnostic, and proposes methods to improve zero-shot translation performance.
Contribution
It introduces the identity pair concept for analysis and demonstrates that encoder transfer is language-specific, proposing two methods to enhance zero-shot translation.
Findings
Encoder transfers source language to target language subspace.
Zero-shot deficiency is due to entangled representations across languages.
Proposed methods significantly improve zero-shot translation performance.
Abstract
Understanding representation transfer in multilingual neural machine translation (MNMT) can reveal the reason for the zero-shot translation deficiency. In this work, we systematically analyze the representational issue of MNMT models. We first introduce the identity pair, translating a sentence to itself, to address the lack of the base measure in multilingual investigations, as the identity pair can reflect the representation of a language within the model. Then, we demonstrate that the encoder transfers the source language to the representational subspace of the target language instead of the language-agnostic state. Thus, the zero-shot translation deficiency arises because the representation of a translation is entangled with other languages and not transferred to the target language effectively. Based on our findings, we propose two methods: 1) low-rank language-specific embedding…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques
MethodsBalanced Selection · Contrastive Learning
