Learning Stochastic Recurrent Networks
Justin Bayer, Christian Osendorfer

TL;DR
This paper introduces Stochastic Recurrent Networks (STORNs), which incorporate latent variables into RNNs using variational inference, enabling structured probabilistic modeling and improved training for sequential data.
Contribution
The paper presents a novel stochastic RNN model that can be trained efficiently with stochastic gradients and supports complex conditional distributions.
Findings
Effective on polyphonic music datasets
Outperforms deterministic RNNs in modeling complex sequences
Provides reliable marginal likelihood estimation
Abstract
Leveraging advances in variational inference, we propose to enhance recurrent neural networks with latent variables, resulting in Stochastic Recurrent Networks (STORNs). The model i) can be trained with stochastic gradient methods, ii) allows structured and multi-modal conditionals at each time step, iii) features a reliable estimator of the marginal likelihood and iv) is a generalisation of deterministic recurrent neural networks. We evaluate the method on four polyphonic musical data sets and motion capture data.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Neural Networks and Applications · Bayesian Modeling and Causal Inference
