Learning Stochastic Recurrent Networks

Justin Bayer; Christian Osendorfer

arXiv:1411.7610·stat.ML·March 9, 2015·196 cites

Learning Stochastic Recurrent Networks

Justin Bayer, Christian Osendorfer

PDF

Open Access 1 Repo

TL;DR

This paper introduces Stochastic Recurrent Networks (STORNs), which incorporate latent variables into RNNs using variational inference, enabling structured probabilistic modeling and improved training for sequential data.

Contribution

The paper presents a novel stochastic RNN model that can be trained efficiently with stochastic gradients and supports complex conditional distributions.

Findings

01

Effective on polyphonic music datasets

02

Outperforms deterministic RNNs in modeling complex sequences

03

Provides reliable marginal likelihood estimation

Abstract

Leveraging advances in variational inference, we propose to enhance recurrent neural networks with latent variables, resulting in Stochastic Recurrent Networks (STORNs). The model i) can be trained with stochastic gradient methods, ii) allows structured and multi-modal conditionals at each time step, iii) features a reliable estimator of the marginal likelihood and iv) is a generalisation of deterministic recurrent neural networks. We evaluate the method on four polyphonic musical data sets and motion capture data.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dgedon/DeepSSM_SysID
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Neural Networks and Applications · Bayesian Modeling and Causal Inference