Accelerated Reinforcement Learning for Sentence Generation by Vocabulary   Prediction

Kazuma Hashimoto; Yoshimasa Tsuruoka

arXiv:1809.01694·cs.CL·April 8, 2019·1 cites

Accelerated Reinforcement Learning for Sentence Generation by Vocabulary Prediction

Kazuma Hashimoto, Yoshimasa Tsuruoka

PDF

Open Access 1 Repo

TL;DR

This paper introduces a dynamic vocabulary prediction method to reduce action space in reinforcement learning for sentence generation, resulting in faster training, less memory use, and improved BLEU scores.

Contribution

The paper proposes a novel dynamic vocabulary prediction approach that significantly improves reinforcement learning efficiency and performance in sentence generation tasks.

Findings

01

Achieves approximately 2.7x faster reinforcement learning

02

Uses about 2.3x less GPU memory

03

Attains equal or better BLEU scores with faster decoding

Abstract

A major obstacle in reinforcement learning-based sentence generation is the large action space whose size is equal to the vocabulary size of the target-side language. To improve the efficiency of reinforcement learning, we present a novel approach for reducing the action space based on dynamic vocabulary prediction. Our method first predicts a fixed-size small vocabulary for each input to generate its target sentence. The input-specific vocabularies are then used at supervised and reinforcement learning steps, and also at test time. In our experiments on six machine translation and two image captioning datasets, our method achieves faster reinforcement learning ( $\sim$ 2.7x faster) with less GPU memory ( $\sim$ 2.3x less) than the full-vocabulary counterpart. The reinforcement learning with our method consistently leads to significant improvement of BLEU scores, and the scores are equal to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hassyGo/NLG-RL
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Natural Language Processing Techniques