Language-Informed Beam Search Decoding for Multilingual Machine Translation
Yilin Yang, Stefan Lee, Prasad Tadepalli

TL;DR
This paper introduces Language-informed Beam Search (LiBS), a decoding method that integrates language identification to significantly reduce off-target translations in multilingual NMT models, improving translation accuracy without extra data.
Contribution
The paper proposes LiBS, a novel decoding algorithm that incorporates language identification into beam search, effectively reducing off-target outputs in multilingual neural machine translation.
Findings
LiBS reduces off-target translation rates from 22.9% to 7.7%.
LiBS improves BLEU scores by +1.1 and +0.9 on WMT and OPUS datasets.
LiBS is model-agnostic and requires no additional parallel data.
Abstract
Beam search decoding is the de-facto method for decoding auto-regressive Neural Machine Translation (NMT) models, including multilingual NMT where the target language is specified as an input. However, decoding multilingual NMT models commonly produces ``off-target'' translations -- yielding translation outputs not in the intended language. In this paper, we first conduct an error analysis of off-target translations for a strong multilingual NMT model and identify how these decodings are produced during beam search. We then propose Language-informed Beam Search (LiBS), a general decoding algorithm incorporating an off-the-shelf Language Identification (LiD) model into beam search decoding to reduce off-target translations. LiBS is an inference-time procedure that is NMT-model agnostic and does not require any additional parallel data. Results show that our proposed LiBS algorithm on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques
