Neural Machine Translation with Supervised Attention

Lemao Liu; Masao Utiyama; Andrew Finch; Eiichiro Sumita

arXiv:1609.04186·cs.CL·September 15, 2016·27 cites

Neural Machine Translation with Supervised Attention

Lemao Liu, Masao Utiyama, Andrew Finch, Eiichiro Sumita

PDF

Open Access

TL;DR

This paper introduces a supervised attention mechanism for neural machine translation that leverages conventional alignment models to improve alignment accuracy and translation quality.

Contribution

It proposes a novel supervised attention approach guided by traditional alignment models, enhancing NMT performance.

Findings

01

Supervised attention improves alignment accuracy.

02

Enhanced alignments lead to better translation quality.

03

Significant gains over standard attention-based NMT.

Abstract

The attention mechanisim is appealing for neural machine translation, since it is able to dynam- ically encode a source sentence by generating a alignment between a target word and source words. Unfortunately, it has been proved to be worse than conventional alignment models in aligment accuracy. In this paper, we analyze and explain this issue from the point view of re- ordering, and propose a supervised attention which is learned with guidance from conventional alignment models. Experiments on two Chinese-to-English translation tasks show that the super- vised attention mechanism yields better alignments leading to substantial gains over the standard attention based NMT.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification