Input-to-Output Gate to Improve RNN Language Models

Sho Takase; Jun Suzuki; Masaaki Nagata

arXiv:1709.08907·cs.CL·September 29, 2017

Input-to-Output Gate to Improve RNN Language Models

Sho Takase, Jun Suzuki, Masaaki Nagata

PDF

Open Access 1 Repo

TL;DR

This paper introduces the Input-to-Output Gate (IOG), a simple reinforcement method that enhances RNN language models' performance by refining their output layers, demonstrated on Penn Treebank and WikiText-2 datasets.

Contribution

The paper presents a novel, simple gating mechanism called IOG that can be integrated with existing RNN language models to improve their performance.

Findings

01

IOG consistently improves RNN language model performance.

02

Effective across different RNN architectures.

03

Demonstrated on Penn Treebank and WikiText-2 datasets.

Abstract

This paper proposes a reinforcing method that refines the output layers of existing Recurrent Neural Network (RNN) language models. We refer to our proposed method as Input-to-Output Gate (IOG). IOG has an extremely simple structure, and thus, can be easily combined with any RNN language models. Our experiments on the Penn Treebank and WikiText-2 datasets demonstrate that IOG consistently boosts the performance of several different types of current topline RNN language models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nttcslab-nlp/iog
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications