NeuralDPS: Neural Deterministic Plus Stochastic Model with Multiband Excitation for Noise-Controllable Waveform Generation
Tao Wang, Ruibo Fu, Jiangyan Yi, Jianhua Tao, Zhengqi Wen

TL;DR
NeuralDPS is a neural vocoder that combines deterministic and stochastic modeling with multiband excitation to produce high-quality, noise-controllable speech efficiently, outperforming traditional neural vocoders in speed and noise management.
Contribution
This paper introduces NeuralDPS, a novel neural vocoder that integrates deterministic and stochastic modules with multiband excitation for improved speech quality, efficiency, and noise control.
Findings
Achieves similar speech quality to WaveNet.
Generates waveforms at least 280 times faster than WaveNet.
Effectively controls noise components and adjusts SNR.
Abstract
The traditional vocoders have the advantages of high synthesis efficiency, strong interpretability, and speech editability, while the neural vocoders have the advantage of high synthesis quality. To combine the advantages of two vocoders, inspired by the traditional deterministic plus stochastic model, this paper proposes a novel neural vocoder named NeuralDPS which can retain high speech quality and acquire high synthesis efficiency and noise controllability. Firstly, this framework contains four modules: a deterministic source module, a stochastic source module, a neural V/UV decision module and a neural filter module. The input required by the vocoder is just the spectral parameter, which avoids the error caused by estimating additional parameters, such as F0. Secondly, to solve the problem that different frequency bands may have different proportions of deterministic components and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsDilated Causal Convolution · Mixture of Logistic Distributions · WaveNet
