A Study of Reinforcement Learning for Neural Machine Translation

Lijun Wu; Fei Tian; Tao Qin; Jianhuang Lai; Tie-Yan Liu

arXiv:1808.08866·cs.LG·August 28, 2018·19 cites

A Study of Reinforcement Learning for Neural Machine Translation

Lijun Wu, Fei Tian, Tao Qin, Jianhuang Lai, Tie-Yan Liu

PDF

Open Access 1 Repo

TL;DR

This paper systematically studies reinforcement learning for neural machine translation, addressing training challenges, comparing key factors, and proposing a new method to enhance performance using monolingual data, achieving state-of-the-art results.

Contribution

It provides a comprehensive analysis of RL training for NMT, introduces a novel approach leveraging monolingual data, and demonstrates improved translation performance on multiple benchmarks.

Findings

01

RL training stability varies with baseline reward and reward shaping.

02

The proposed method effectively utilizes monolingual data to boost NMT performance.

03

Achieved state-of-the-art results on WMT17 Chinese-English translation.

Abstract

Recent studies have shown that reinforcement learning (RL) is an effective approach for improving the performance of neural machine translation (NMT) system. However, due to its instability, successfully RL training is challenging, especially in real-world systems where deep models and large datasets are leveraged. In this paper, taking several large-scale translation tasks as testbeds, we conduct a systematic study on how to train better NMT models using reinforcement learning. We provide a comprehensive comparison of several important factors (e.g., baseline reward, reward shaping) in RL training. Furthermore, to fill in the gap that it remains unclear whether RL is still beneficial when monolingual data is used, we propose a new method to leverage RL to further boost the performance of NMT systems trained with source/target monolingual data. By integrating all our findings, we obtain…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

apeterswu/RL4NMT
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications