Incorporating Structural Alignment Biases into an Attentional Neural   Translation Model

Trevor Cohn; Cong Duy Vu Hoang; Ekaterina Vymolova; Kaisheng; Yao; Chris Dyer; Gholamreza Haffari

arXiv:1601.01085·cs.CL·January 7, 2016

Incorporating Structural Alignment Biases into an Attentional Neural Translation Model

Trevor Cohn, Cong Duy Vu Hoang, Ekaterina Vymolova, Kaisheng, Yao, Chris Dyer, Gholamreza Haffari

PDF

TL;DR

This paper enhances neural machine translation models by integrating structural biases from traditional alignment models, leading to improved performance especially in low-resource language pairs.

Contribution

It introduces a method to incorporate structural alignment biases into attentional neural translation models, improving translation quality.

Findings

01

Improved translation accuracy over baseline models

02

Effective in low-resource language settings

03

Outperforms standard phrase-based models

Abstract

Neural encoder-decoder models of machine translation have achieved impressive results, rivalling traditional translation models. However their modelling formulation is overly simplistic, and omits several key inductive biases built into traditional models. In this paper we extend the attentional neural translation model to include structural biases from word based alignment models, including positional bias, Markov conditioning, fertility and agreement over translation directions. We show improvements over a baseline attentional model and standard phrase-based model over several language pairs, evaluating on difficult languages in a low resource setting.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.