Implicit Distortion and Fertility Models for Attention-based   Encoder-Decoder NMT Model

Shi Feng; Shujie Liu; Mu Li; Ming Zhou

arXiv:1601.03317·cs.CL·January 25, 2016·39 cites

Implicit Distortion and Fertility Models for Attention-based Encoder-Decoder NMT Model

Shi Feng, Shujie Liu, Mu Li, Ming Zhou

PDF

Open Access

TL;DR

This paper introduces new attention-based encoder-decoder models with implicit distortion and fertility mechanisms to improve translation quality in neural machine translation, addressing alignment issues.

Contribution

The paper proposes novel variations of attention mechanisms incorporating implicit distortion and fertility models, enhancing translation accuracy over standard models.

Findings

01

Achieved a 2 BLEU point improvement over original models.

02

Demonstrated that addressing alignment issues improves translation quality.

03

Compared new models with existing approaches on machine translation tasks.

Abstract

Neural machine translation has shown very promising results lately. Most NMT models follow the encoder-decoder framework. To make encoder-decoder models more flexible, attention mechanism was introduced to machine translation and also other tasks like speech recognition and image captioning. We observe that the quality of translation by attention-based encoder-decoder can be significantly damaged when the alignment is incorrect. We attribute these problems to the lack of distortion and fertility models. Aiming to resolve these problems, we propose new variations of attention-based encoder-decoder and compare them with other models on machine translation. Our proposed method achieved an improvement of 2 BLEU points over the original attention-based encoder-decoder.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Natural Language Processing Techniques · Topic Modeling