Universal Vector Neural Machine Translation With Effective Attention

Satish Mylapore; Ryan Quincy Paul; Joshua Yi; and Robert D. Slater

arXiv:2006.05003·cs.CL·June 11, 2020

Universal Vector Neural Machine Translation With Effective Attention

Satish Mylapore, Ryan Quincy Paul, Joshua Yi, and Robert D. Slater

PDF

TL;DR

This paper introduces a universal neural machine translation model that can handle multiple languages with a single encoder-decoder architecture, reducing the need for multiple models and improving translation flexibility.

Contribution

The paper presents a novel universal NMT model with an integrated attention mechanism that supports multiple languages within one model, streamlining multilingual translation tasks.

Findings

01

Reduces the number of models needed for multilingual translation

02

Increases translation accuracy for long sentences

03

Supports multiple language pairs with a single model

Abstract

Neural Machine Translation (NMT) leverages one or more trained neural networks for the translation of phrases. Sutskever introduced a sequence to sequence based encoder-decoder model which became the standard for NMT based systems. Attention mechanisms were later introduced to address the issues with the translation of long sentences and improving overall accuracy. In this paper, we propose a singular model for Neural Machine Translation based on encoder-decoder models. Most translation models are trained as one model for one translation. We introduce a neutral/universal model representation that can be used to predict more than one language depending on the source and a provided target. Secondly, we introduce an attention model by adding an overall learning vector to the multiplicative model. With these two changes, by using the novel universal model the number of models needed for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.