LCS: A Language Converter Strategy for Zero-Shot Neural Machine Translation
Zengkui Sun, Yijin Liu, Fandong Meng, Jinan Xu, Yufeng Chen, Jie Zhou

TL;DR
This paper introduces the Language Converter Strategy (LCS), a novel approach that improves zero-shot neural machine translation by stabilizing language indication and reducing off-target errors across multiple datasets.
Contribution
LCS effectively mitigates the off-target issue in zero-shot translation by embedding target language information into encoder layers, outperforming traditional language tag strategies.
Findings
LCS achieves up to 95.28% language accuracy.
LCS improves BLEU scores by up to 7.93 points.
LCS outperforms vanilla language tag strategies on multiple datasets.
Abstract
Multilingual neural machine translation models generally distinguish translation directions by the language tag (LT) in front of the source or target sentences. However, current LT strategies cannot indicate the desired target language as expected on zero-shot translation, i.e., the off-target issue. Our analysis reveals that the indication of the target language is sensitive to the placement of the target LT. For example, when placing the target LT on the decoder side, the indication would rapidly degrade along with decoding steps, while placing the target LT on the encoder side would lead to copying or paraphrasing the source input. To address the above issues, we propose a simple yet effective strategy named Language Converter Strategy (LCS). By introducing the target language embedding into the top encoder layers, LCS mitigates confusion in the encoder and ensures stable language…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsNatural Language Processing Techniques
