Towards Reinforcement Learning for Pivot-based Neural Machine   Translation with Non-autoregressive Transformer

Evgeniia Tokarchuk; Jan Rosendahl; Weiyue Wang; Pavel Petrushkov,; Tomer Lancewicki; Shahram Khadivi; Hermann Ney

arXiv:2109.13097·cs.CL·September 28, 2021·1 cites

Towards Reinforcement Learning for Pivot-based Neural Machine Translation with Non-autoregressive Transformer

Evgeniia Tokarchuk, Jan Rosendahl, Weiyue Wang, Pavel Petrushkov,, Tomer Lancewicki, Shahram Khadivi, Hermann Ney

PDF

Open Access

TL;DR

This paper introduces a reinforcement learning approach to train a non-autoregressive transformer for pivot-based neural machine translation, enabling end-to-end source-target translation in low-resource settings.

Contribution

It proposes an integrated RL-based training method for pivot-based NMT using a non-autoregressive transformer, connecting sub-tasks into a unified model.

Findings

01

Improved translation quality in low-resource language pairs.

02

End-to-end training enhances source-target translation performance.

03

Demonstrates effectiveness of RL in pivot-based NMT.

Abstract

Pivot-based neural machine translation (NMT) is commonly used in low-resource setups, especially for translation between non-English language pairs. It benefits from using high resource source-pivot and pivot-target language pairs and an individual system is trained for both sub-tasks. However, these models have no connection during training, and the source-pivot model is not optimized to produce the best translation for the source-target task. In this work, we propose to train a pivot-based NMT system with the reinforcement learning (RL) approach, which has been investigated for various text generation tasks, including machine translation (MT). We utilize a non-autoregressive transformer and present an end-to-end pivot-based integrated model, enabling training on source-target data.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications