# Graph Convolutional Encoders for Syntax-aware Neural Machine Translation

**Authors:** Jasmijn Bastings, Ivan Titov, Wilker Aziz, Diego Marcheggiani, Khalil, Sima'an

arXiv: 1704.04675 · 2020-06-22

## TL;DR

This paper introduces a graph convolutional network-based method to incorporate syntactic dependency structures into neural machine translation models, significantly improving translation quality across multiple language pairs.

## Contribution

It proposes a novel use of GCNs to embed syntactic information into encoder representations for neural machine translation, enhancing performance over syntax-agnostic models.

## Key findings

- Substantial improvements in translation quality with GCNs
- Effective integration of syntactic structures into various encoder types
- Consistent gains across English-German and English-Czech translation tasks

## Abstract

We present a simple and effective approach to incorporating syntactic structure into neural attention-based encoder-decoder models for machine translation. We rely on graph-convolutional networks (GCNs), a recent class of neural networks developed for modeling graph-structured data. Our GCNs use predicted syntactic dependency trees of source sentences to produce representations of words (i.e. hidden states of the encoder) that are sensitive to their syntactic neighborhoods. GCNs take word representations as input and produce word representations as output, so they can easily be incorporated as layers into standard encoders (e.g., on top of bidirectional RNNs or convolutional neural networks). We evaluate their effectiveness with English-German and English-Czech translation experiments for different types of encoders and observe substantial improvements over their syntax-agnostic versions in all the considered setups.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1704.04675/full.md

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/1704.04675/full.md

## References

47 references — full list in the complete paper: https://tomesphere.com/paper/1704.04675/full.md

---
Source: https://tomesphere.com/paper/1704.04675