ACT-MNMT Auto-Constriction Turning for Multilingual Neural Machine   Translation

Shaojie Dai; Xin Liu; Ping Luo; Yue Yu

arXiv:2403.06745·cs.CL·March 12, 2024·1 cites

ACT-MNMT Auto-Constriction Turning for Multilingual Neural Machine Translation

Shaojie Dai, Xin Liu, Ping Luo, Yue Yu

PDF

Open Access

TL;DR

This paper introduces ACT-MNMT, a supervised fine-tuning method that constructs constrained templates with trigger tokens to improve multilingual translation accuracy and reduce off-target issues in LLM-based models.

Contribution

It proposes a novel Auto-Constriction Turning mechanism that enhances multilingual NMT by automatically creating constrained templates with trigger tokens, orthogonal to prompt-based methods.

Findings

01

Significantly improves translation performance across multiple directions.

02

Reduces off-target translation phenomena.

03

Demonstrates effectiveness on WMT test sets.

Abstract

Large language model (LLM) has achieved promising performance in multilingual machine translation tasks through zero/few-shot prompts or prompt-tuning. However, due to the mixture of multilingual data during the pre-training of LLM, the LLM-based translation models face the off-target issue in both prompt-based methods, including a series of phenomena, namely instruction misunderstanding, translation with wrong language and over-generation. For this issue, this paper introduces an \textbf{\underline{A}}uto-\textbf{\underline{C}}onstriction \textbf{\underline{T}}urning mechanism for \textbf{\underline{M}}ultilingual \textbf{\underline{N}}eural \textbf{\underline{M}}achine \textbf{\underline{T}}ranslation (\model), which is a novel supervised fine-tuning mechanism and orthogonal to the traditional prompt-based methods. In this method, \model automatically constructs a constrained template…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques