Neural Machine Translation with Supervised Attention
Lemao Liu, Masao Utiyama, Andrew Finch, Eiichiro Sumita

TL;DR
This paper introduces a supervised attention mechanism for neural machine translation that leverages conventional alignment models to improve alignment accuracy and translation quality.
Contribution
It proposes a novel supervised attention approach guided by traditional alignment models, enhancing NMT performance.
Findings
Supervised attention improves alignment accuracy.
Enhanced alignments lead to better translation quality.
Significant gains over standard attention-based NMT.
Abstract
The attention mechanisim is appealing for neural machine translation, since it is able to dynam- ically encode a source sentence by generating a alignment between a target word and source words. Unfortunately, it has been proved to be worse than conventional alignment models in aligment accuracy. In this paper, we analyze and explain this issue from the point view of re- ordering, and propose a supervised attention which is learned with guidance from conventional alignment models. Experiments on two Chinese-to-English translation tasks show that the super- vised attention mechanism yields better alignments leading to substantial gains over the standard attention based NMT.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification
