Information-Propogation-Enhanced Neural Machine Translation by Relation   Model

Wen Zhang; Jiawei Hu; Yang Feng; Qun Liu

arXiv:1709.01766·cs.CL·May 28, 2018

Information-Propogation-Enhanced Neural Machine Translation by Relation Model

Wen Zhang, Jiawei Hu, Yang Feng, Qun Liu

PDF

Open Access

TL;DR

This paper introduces a relation network into neural machine translation to improve long-distance information flow, significantly enhancing translation quality over existing models.

Contribution

It proposes a novel relation network-enhanced NMT framework that better captures long-term dependencies, addressing limitations of RNNs and CNNs in sequence modeling.

Findings

01

Outperforms statistical MT models on two datasets.

02

Surpasses state-of-the-art NMT models in translation quality.

03

Enhances information propagation in neural networks.

Abstract

Even though sequence-to-sequence neural machine translation (NMT) model have achieved state-of-art performance in the recent fewer years, but it is widely concerned that the recurrent neural network (RNN) units are very hard to capture the long-distance state information, which means RNN can hardly find the feature with long term dependency as the sequence becomes longer. Similarly, convolutional neural network (CNN) is introduced into NMT for speeding recently, however, CNN focus on capturing the local feature of the sequence; To relieve this issue, we incorporate a relation network into the standard encoder-decoder framework to enhance information-propogation in neural network, ensuring that the information of the source sentence can flow into the decoder adequately. Experiments show that proposed framework outperforms the statistical MT model and the state-of-art NMT model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques