Supervised Attentions for Neural Machine Translation

Haitao Mi; Zhiguo Wang; Abe Ittycheriah

arXiv:1608.00112·cs.CL·August 2, 2016·29 cites

Supervised Attentions for Neural Machine Translation

Haitao Mi, Zhiguo Wang, Abe Ittycheriah

PDF

Open Access

TL;DR

This paper introduces a supervised attention mechanism in neural machine translation that leverages true alignments during training, significantly enhancing translation and alignment quality over existing models.

Contribution

It proposes a novel method to incorporate true alignment information into neural machine translation training, improving accuracy and outperforming traditional syntax-based systems.

Findings

01

Improved translation quality on Chinese-English tasks

02

Enhanced alignment accuracy with supervised attention

03

Outperforms state-of-the-art syntax-based systems

Abstract

In this paper, we improve the attention or alignment accuracy of neural machine translation by utilizing the alignments of training sentence pairs. We simply compute the distance between the machine attentions and the "true" alignments, and minimize this cost in the training procedure. Our experiments on large-scale Chinese-to-English task show that our model improves both translation and alignment qualities significantly over the large-vocabulary neural machine translation system, and even beats a state-of-the-art traditional syntax-based system.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Semantic Web and Ontologies