VaSAB: The variable size adaptive information bottleneck for disentanglement on speech and singing voice
Frederik Bous, Axel Roebel

TL;DR
This paper introduces VaSAB, a variable size adaptive information bottleneck using dropout for voice disentanglement, improving synthesis quality and enabling a universal model for speech and singing voices.
Contribution
It proposes a novel adaptive bottleneck method with dropout, allowing dynamic adjustment of the bottleneck size for better disentanglement and synthesis in voice transformation.
Findings
Improved disentanglement of F0 parameter in speech and singing voice.
Achieved high-pitch disentanglement in singing voice.
Created a universal voice model for speech and singing.
Abstract
The information bottleneck auto-encoder is a tool for disentanglement commonly used for voice transformation. The successful disentanglement relies on the right choice of bottleneck size. Previous bottleneck auto-encoders created the bottleneck by the dimension of the latent space or through vector quantization and had no means to change the bottleneck size of a specific model. As the bottleneck removes information from the disentangled representation, the choice of bottleneck size is a trade-off between disentanglement and synthesis quality. We propose to build the information bottleneck using dropout which allows us to change the bottleneck through the dropout rate and investigate adapting the bottleneck size depending on the context. We experimentally explore into using the adaptive bottleneck for pitch transformation and demonstrate that the adaptive bottleneck leads to improved…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Music and Audio Processing · Speech and Audio Processing
MethodsDropout
