Refining Source Representations with Relation Networks for Neural   Machine Translation

Wen Zhang; Jiawei Hu; Yang Feng; Qun Liu

arXiv:1709.03980·cs.CL·May 28, 2018·6 cites

Refining Source Representations with Relation Networks for Neural Machine Translation

Wen Zhang, Jiawei Hu, Yang Feng, Qun Liu

PDF

Open Access

TL;DR

This paper introduces relation networks into neural machine translation to enhance source representations by modeling word relationships, leading to significant improvements over baseline models.

Contribution

It proposes a novel integration of relation networks into NMT to refine source encoding without altering the core encoder-decoder architecture.

Findings

01

Outperforms baseline models on Chinese-English translation tasks

02

Enhances source representations by modeling word relations

03

Achieves significant translation quality improvements

Abstract

Although neural machine translation (NMT) with the encoder-decoder framework has achieved great success in recent times, it still suffers from some drawbacks: RNNs tend to forget old information which is often useful and the encoder only operates through words without considering word relationship. To solve these problems, we introduce a relation networks (RN) into NMT to refine the encoding representations of the source. In our method, the RN first augments the representation of each source word with its neighbors and reasons all the possible pairwise relations between them. Then the source representations and all the relations are fed to the attention module and the decoder together, keeping the main encoder-decoder architecture unchanged. Experiments on two Chinese-to-English data sets in different scales both show that our method can outperform the competitive baselines…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications