LCS: A Language Converter Strategy for Zero-Shot Neural Machine   Translation

Zengkui Sun; Yijin Liu; Fandong Meng; Jinan Xu; Yufeng Chen; Jie Zhou

arXiv:2406.02876·cs.CL·June 7, 2024·1 cites

LCS: A Language Converter Strategy for Zero-Shot Neural Machine Translation

Zengkui Sun, Yijin Liu, Fandong Meng, Jinan Xu, Yufeng Chen, Jie Zhou

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces the Language Converter Strategy (LCS), a novel approach that improves zero-shot neural machine translation by stabilizing language indication and reducing off-target errors across multiple datasets.

Contribution

LCS effectively mitigates the off-target issue in zero-shot translation by embedding target language information into encoder layers, outperforming traditional language tag strategies.

Findings

01

LCS achieves up to 95.28% language accuracy.

02

LCS improves BLEU scores by up to 7.93 points.

03

LCS outperforms vanilla language tag strategies on multiple datasets.

Abstract

Multilingual neural machine translation models generally distinguish translation directions by the language tag (LT) in front of the source or target sentences. However, current LT strategies cannot indicate the desired target language as expected on zero-shot translation, i.e., the off-target issue. Our analysis reveals that the indication of the target language is sensitive to the placement of the target LT. For example, when placing the target LT on the decoder side, the indication would rapidly degrade along with decoding steps, while placing the target LT on the encoder side would lead to copying or paraphrasing the source input. To address the above issues, we propose a simple yet effective strategy named Language Converter Strategy (LCS). By introducing the target language embedding into the top encoder layers, LCS mitigates confusion in the encoder and ensures stable language…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

acerkoo/lcs
pytorchOfficial

Videos

LCS: A Language Converter Strategy for Zero-Shot Neural Machine Translation· underline

Taxonomy

TopicsNatural Language Processing Techniques