Dynamic Fusion: Attentional Language Model for Neural Machine   Translation

Michiki Kurosawa; Mamoru Komachi

arXiv:1909.04879·cs.CL·September 12, 2019

Dynamic Fusion: Attentional Language Model for Neural Machine Translation

Michiki Kurosawa, Mamoru Komachi

PDF

Open Access

TL;DR

This paper introduces Dynamic Fusion, an attentive mechanism that effectively integrates language models into neural machine translation, improving translation quality by dynamically considering translation history and grammatical structure.

Contribution

The work proposes a novel attentive fusion approach that adaptively combines language and translation models, addressing limitations of previous static weighting methods.

Findings

01

Improved BLEU and RIBES scores in English-Japanese translation

02

Enhanced grammatical conformity in language modeling

03

Dynamic fusion outperforms previous integration methods

Abstract

Neural Machine Translation (NMT) can be used to generate fluent output. As such, language models have been investigated for incorporation with NMT. In prior investigations, two models have been used: a translation model and a language model. The translation model's predictions are weighted by the language model with a hand-crafted ratio in advance. However, these approaches fail to adopt the language model weighting with regard to the translation history. In another line of approach, language model prediction is incorporated into the translation model by jointly considering source and target information. However, this line of approach is limited because it largely ignores the adequacy of the translation output. Accordingly, this work employs two mechanisms, the translation model and the language model, with an attentive architecture to the language model as an auxiliary element of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications