Exploring the Importance of F0 Trajectories for Speaker Anonymization   using X-vectors and Neural Waveform Models

\"Unal Ege Gaznepoglu; Nils Peters

arXiv:2110.06887·eess.AS·October 14, 2021·5 cites

Exploring the Importance of F0 Trajectories for Speaker Anonymization using X-vectors and Neural Waveform Models

\"Unal Ege Gaznepoglu, Nils Peters

PDF

Open Access

TL;DR

This paper investigates the role of F0 trajectories in speaker anonymization, demonstrating that F0 modifications can significantly enhance anonymization effectiveness with minimal impact on speech recognition accuracy.

Contribution

It introduces and evaluates eight low-complexity F0 modification methods within a speaker anonymization framework, highlighting the importance of F0 in privacy preservation.

Findings

01

F0 modifications can improve anonymization by up to 8%.

02

F0 adjustments cause minor word-error rate degradation.

03

F0 plays a crucial role in speaker anonymization effectiveness.

Abstract

Voice conversion for speaker anonymization is an emerging field in speech processing research. Many state-of-the-art approaches are based on the resynthesis of the phoneme posteriorgrams (PPG), the fundamental frequency (F0) of the input signal together with modified X-vectors. Our research focuses on the role of F0 for speaker anonymization, which is an understudied area. Utilizing the VoicePrivacy Challenge 2020 framework and its datasets we developed and evaluated eight low-complexity F0 modifications prior resynthesis. We found that modifying the F0 can improve speaker anonymization by as much as 8% with minor word-error rate degradation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Music and Audio Processing