Parallel and Limited Data Voice Conversion Using Stochastic Variational   Deep Kernel Learning

Mohamadreza Jafaryani; Hamid Sheikhzadeh; Vahid Pourahmadi

arXiv:2309.04420·cs.SD·September 11, 2023

Parallel and Limited Data Voice Conversion Using Stochastic Variational Deep Kernel Learning

Mohamadreza Jafaryani, Hamid Sheikhzadeh, Vahid Pourahmadi

PDF

TL;DR

This paper introduces a voice conversion method that effectively works with limited data by combining stochastic variational deep kernel learning with neural networks, achieving high-quality results with only about 80 seconds of training data.

Contribution

The proposed approach integrates deep neural networks with Gaussian processes using SVDKL, enabling effective voice conversion with limited data and addressing scalability and overfitting issues.

Findings

01

Higher mean opinion score than compared methods

02

Smaller spectral distortion achieved

03

Better preference test results

Abstract

Typically, voice conversion is regarded as an engineering problem with limited training data. The reliance on massive amounts of data hinders the practical applicability of deep learning approaches, which have been extensively researched in recent years. On the other hand, statistical methods are effective with limited data but have difficulties in modelling complex mapping functions. This paper proposes a voice conversion method that works with limited data and is based on stochastic variational deep kernel learning (SVDKL). At the same time, SVDKL enables the use of deep neural networks' expressive capability as well as the high flexibility of the Gaussian process as a Bayesian and non-parametric method. When the conventional kernel is combined with the deep neural network, it is possible to estimate non-smooth and more complex functions. Furthermore, the model's sparse variational…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsGaussian Process