Continuous Wavelet Vocoder-based Decomposition of Parametric Speech   Waveform Synthesis

Mohammed Salah Al-Radhi; Tam\'as G\'abor Csap\'o; Csaba Zaink\'o,; G\'eza N\'emeth

arXiv:2106.06863·cs.SD·June 15, 2021

Continuous Wavelet Vocoder-based Decomposition of Parametric Speech Waveform Synthesis

Mohammed Salah Al-Radhi, Tam\'as G\'abor Csap\'o, Csaba Zaink\'o,, G\'eza N\'emeth

PDF

TL;DR

This paper discusses the use of continuous wavelet vocoders for decomposing parametric speech waveforms, aiming to improve speech synthesis quality and efficiency.

Contribution

It introduces a novel vocoder-based decomposition method utilizing continuous wavelet transforms for parametric speech waveform synthesis.

Findings

01

Enhanced speech quality with wavelet-based decomposition

02

Reduced computational complexity compared to neural network models

03

Potential for real-time speech synthesis applications

Abstract

To date, various speech technology systems have adopted the vocoder approach, a method for synthesizing speech waveform that shows a major role in the performance of statistical parametric speech synthesis. WaveNet one of the best models that nearly resembles the human voice, has to generate a waveform in a time consuming sequential manner with an extremely complex structure of its neural networks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsDilated Causal Convolution · Mixture of Logistic Distributions · WaveNet