A Shared Attention Mechanism for Interpretation of Neural Automatic   Post-Editing Systems

Inigo Jauregi Unanue; Ehsan Zare Borzeshi; Massimo Piccardi

arXiv:1807.00248·cs.CL·July 3, 2018

A Shared Attention Mechanism for Interpretation of Neural Automatic Post-Editing Systems

Inigo Jauregi Unanue, Ehsan Zare Borzeshi, Massimo Piccardi

PDF

1 Repo

TL;DR

This paper introduces a neural automatic post-editing system with a shared attention mechanism that improves interpretability and effectively utilizes source and machine translation inputs to correct errors.

Contribution

It proposes a novel shared attention mechanism for neural APE systems, enhancing interpretability and leveraging both source and translation inputs more effectively.

Findings

01

Attention shifts toward source tokens when MT is incorrect.

02

System achieves comparable accuracy to existing models.

03

Provides better interpretability of the post-editing process.

Abstract

Automatic post-editing (APE) systems aim to correct the systematic errors made by machine translators. In this paper, we propose a neural APE system that encodes the source (src) and machine translated (mt) sentences with two separate encoders, but leverages a shared attention mechanism to better understand how the two inputs contribute to the generation of the post-edited (pe) sentences. Our empirical observations have showed that when the mt is incorrect, the attention shifts weight toward tokens in the src sentence to properly edit the incorrect translation. The model has been trained and evaluated on the official data from the WMT16 and WMT17 APE IT domain English-German shared tasks. Additionally, we have used the extra 500K artificial data provided by the shared task. Our system has been able to reproduce the accuracies of systems trained with the same data, while at the same time…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ijauregiCMCRC/Shared_Attention_for_APE
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.