Stochastic WaveNet: A Generative Latent Variable Model for Sequential Data
Guokun Lai, Bohan Li, Guoqing Zheng, Yiming Yang

TL;DR
Stochastic WaveNet combines stochastic latent variables with dilated convolutions to enhance sequential data modeling, achieving state-of-the-art results in speech and handwriting generation.
Contribution
It introduces a novel architecture that integrates stochastic latent variables into WaveNet, improving distribution modeling and training efficiency for sequential data.
Findings
Achieves state-of-the-art performance on speech datasets.
Generates high-quality human handwriting samples.
Demonstrates improved modeling capacity over existing methods.
Abstract
How to model distribution of sequential data, including but not limited to speech and human motions, is an important ongoing research problem. It has been demonstrated that model capacity can be significantly enhanced by introducing stochastic latent variables in the hidden states of recurrent neural networks. Simultaneously, WaveNet, equipped with dilated convolutions, achieves astonishing empirical performance in natural speech generation task. In this paper, we combine the ideas from both stochastic latent variables and dilated convolutions, and propose a new architecture to model sequential data, termed as Stochastic WaveNet, where stochastic latent variables are injected into the WaveNet structure. We argue that Stochastic WaveNet enjoys powerful distribution modeling capacity and the advantage of parallel training from dilated convolutions. In order to efficiently infer the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting · Music and Audio Processing · Anomaly Detection Techniques and Applications
MethodsMixture of Logistic Distributions · Dilated Causal Convolution · WaveNet
