HLT-MT: High-resource Language-specific Training for Multilingual Neural   Machine Translation

Jian Yang; Yuwei Yin; Shuming Ma; Dongdong Zhang; Zhoujun Li; Furu Wei

arXiv:2207.04906·cs.CL·July 21, 2022

HLT-MT: High-resource Language-specific Training for Multilingual Neural Machine Translation

Jian Yang, Yuwei Yin, Shuming Ma, Dongdong Zhang, Zhoujun Li, Furu Wei

PDF

1 Repo

TL;DR

This paper introduces HLT-MT, a two-stage training approach with language-specific modules to improve multilingual neural machine translation, especially reducing negative interference among languages, leading to better performance on benchmarks.

Contribution

The paper proposes a novel high-resource language-specific training method with a two-stage process and language-specific modules to enhance multilingual translation quality.

Findings

01

Outperforms strong baselines on WMT-10 and OPUS-100 benchmarks.

02

Effectively mitigates negative interference in multilingual training.

03

Improves translation quality for high-resource and low-resource languages.

Abstract

Multilingual neural machine translation (MNMT) trained in multiple language pairs has attracted considerable attention due to fewer model parameters and lower training costs by sharing knowledge among multiple languages. Nonetheless, multilingual training is plagued by language interference degeneration in shared parameters because of the negative interference among different translation directions, especially on high-resource languages. In this paper, we propose the multilingual translation model with the high-resource language-specific training (HLT-MT) to alleviate the negative interference, which adopts the two-stage training with the language-specific selection mechanism. Specifically, we first train the multilingual model only with the high-resource pairs and select the language-specific modules at the top of the decoder to enhance the translation quality of high-resource…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

YuweiYin/HLT-MT
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.