Neural Network-based Word Alignment through Score Aggregation

Joel Legrand; Michael Auli; Ronan Collobert

arXiv:1606.09560·cs.CL·July 1, 2016

Neural Network-based Word Alignment through Score Aggregation

Joel Legrand, Michael Auli, Ronan Collobert

PDF

TL;DR

This paper introduces a neural network model for word alignment that uses score aggregation and a soft-margin objective, achieving improved accuracy over Fast Align on multiple language pairs.

Contribution

The paper proposes a novel neural network architecture with score aggregation for unsupervised word alignment, outperforming existing models like Fast Align.

Findings

01

7 AER improvement on English-Czech

02

6 AER improvement on Romanian-English

03

1.7 AER improvement on English-French

Abstract

We present a simple neural network for word alignment that builds source and target word window representations to compute alignment scores for sentence pairs. To enable unsupervised training, we use an aggregation operation that summarizes the alignment scores for a given target word. A soft-margin objective increases scores for true target words while decreasing scores for target words that are not present. Compared to the popular Fast Align model, our approach improves alignment accuracy by 7 AER on English-Czech, by 6 AER on Romanian-English and by 1.7 AER on English-French alignment.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.