Complete Multilingual Neural Machine Translation

Markus Freitag; Orhan Firat

arXiv:2010.10239·cs.CL·October 21, 2020

Complete Multilingual Neural Machine Translation

Markus Freitag, Orhan Firat

PDF

TL;DR

This paper introduces complete Multilingual Neural Machine Translation (cMNMT), leveraging multi-way aligned data to connect all language pairs, improving translation quality and scalability across 12,432 language pairs.

Contribution

It reintroduces multi-way aligned data to create a complete language graph in MNMT, enhancing translation performance and scalability.

Findings

01

cMNMT achieves competitive translation quality across many language pairs.

02

Multi-way aligned data improves transfer learning and ease of adding new languages.

03

Model scales effectively to over 12,000 language pairs.

Abstract

Multilingual Neural Machine Translation (MNMT) models are commonly trained on a joint set of bilingual corpora which is acutely English-centric (i.e. English either as the source or target language). While direct data between two languages that are non-English is explicitly available at times, its use is not common. In this paper, we first take a step back and look at the commonly used bilingual corpora (WMT), and resurface the existence and importance of implicit structure that existed in it: multi-way alignment across examples (the same sentence in more than two languages). We set out to study the use of multi-way aligned examples to enrich the original English-centric parallel corpora. We reintroduce this direct parallel data from multi-way aligned corpora between all source and target languages. By doing so, the English-centric graph expands into a complete graph, every language…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.