Syntax-Directed Attention for Neural Machine Translation

Kehai Chen; Rui Wang; Masao Utiyama; Eiichiro Sumita; and Tiejun Zhao

arXiv:1711.04231·cs.CL·September 20, 2019·58 cites

Syntax-Directed Attention for Neural Machine Translation

Kehai Chen, Rui Wang, Masao Utiyama, Eiichiro Sumita, and Tiejun Zhao

PDF

Open Access

TL;DR

This paper introduces syntax-directed attention and a double context architecture in neural machine translation, significantly improving translation quality by focusing on syntactically relevant source words.

Contribution

It extends local attention with syntax-distance constraints and proposes a double context NMT model, enhancing translation accuracy over baseline systems.

Findings

01

Significant improvement in Chinese-English translation accuracy.

02

Enhanced focus on syntactically related source words.

03

Outperforms baseline models on large-scale translation tasks.

Abstract

Attention mechanism, including global attention and local attention, plays a key role in neural machine translation (NMT). Global attention attends to all source words for word prediction. In comparison, local attention selectively looks at fixed-window source words. However, alignment weights for the current target word often decrease to the left and right by linear distance centering on the aligned source position and neglect syntax-directed distance constraints. In this paper, we extend local attention with syntax-distance constraint, to focus on syntactically related source words with the predicted target word, thus learning a more effective context vector for word prediction. Moreover, we further propose a double context NMT architecture, which consists of a global context vector and a syntax-directed context vector over the global attention, to provide more translation performance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification