Recurrent Dropout without Memory Loss

Stanislau Semeniuta; Aliaksei Severyn; Erhardt Barth

arXiv:1603.05118·cs.CL·August 8, 2016·100 cites

Recurrent Dropout without Memory Loss

Stanislau Semeniuta, Aliaksei Severyn, Erhardt Barth

PDF

Open Access 2 Repos

TL;DR

This paper introduces a novel recurrent dropout method that drops neurons in recurrent connections without losing long-term memory, improving RNN performance on NLP tasks.

Contribution

It proposes a simple, effective recurrent dropout technique that preserves memory, enhancing RNN regularization without complex modifications.

Findings

01

Consistent performance improvements on NLP benchmarks

02

Effective when combined with standard dropout methods

03

Applicable to LSTM networks with easy implementation

Abstract

This paper presents a novel approach to recurrent neural network (RNN) regularization. Differently from the widely adopted dropout method, which is applied to \textit{forward} connections of feed-forward architectures or RNNs, we propose to drop neurons directly in \textit{recurrent} connections in a way that does not cause loss of long-term memory. Our approach is as easy to implement and apply as the regular feed-forward dropout and we demonstrate its effectiveness for Long Short-Term Memory network, the most popular type of RNN cells. Our experiments on NLP benchmarks show consistent improvements even when combined with conventional feed-forward dropout.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Neural Networks and Applications · Domain Adaptation and Few-Shot Learning

MethodsRecurrent Dropout · Dropout