How to Construct Deep Recurrent Neural Networks

Razvan Pascanu; Caglar Gulcehre; Kyunghyun Cho; Yoshua Bengio

arXiv:1312.6026·cs.NE·April 25, 2014·ICLR·249 cites

How to Construct Deep Recurrent Neural Networks

Razvan Pascanu, Caglar Gulcehre, Kyunghyun Cho, Yoshua Bengio

PDF

Open Access

TL;DR

This paper investigates how to deepen recurrent neural networks by analyzing their architecture, proposing novel deep RNN structures, and demonstrating their improved performance on music prediction and language modeling tasks.

Contribution

The paper introduces two new deep RNN architectures, distinct from stacking layers, with an innovative interpretation using neural operators.

Findings

01

Deep RNNs outperform shallow RNNs in experiments.

02

Proposed architectures benefit from increased depth.

03

Analysis clarifies the concept of depth in RNNs.

Abstract

In this paper, we explore different ways to extend a recurrent neural network (RNN) to a \textit{deep} RNN. We start by arguing that the concept of depth in an RNN is not as clear as it is in feedforward neural networks. By carefully analyzing and understanding the architecture of an RNN, however, we find three points of an RNN which may be made deeper; (1) input-to-hidden function, (2) hidden-to-hidden transition and (3) hidden-to-output function. Based on this observation, we propose two novel architectures of a deep RNN which are orthogonal to an earlier attempt of stacking multiple recurrent layers to build a deep RNN (Schmidhuber, 1992; El Hihi and Bengio, 1996). We provide an alternative interpretation of these deep RNNs using a novel framework based on neural operators. The proposed deep RNNs are empirically evaluated on the tasks of polyphonic music prediction and language…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Music Technology and Sound Studies · Speech Recognition and Synthesis