First Steps Towards Voice Anonymization for Code-Switching Speech
Sarina Meyer, Ekaterina Kolos, Ngoc Thang Vu

TL;DR
This paper explores voice anonymization for code-switching speech, adapting multilingual models to protect speaker identity while addressing unique challenges posed by spontaneous, multilingual data.
Contribution
It introduces the first investigation into voice anonymization for code-switching speech and adapts a multilingual model for this purpose.
Findings
Multilingual system performs well in privacy and utility.
Language-independent methods are less effective.
Challenges exist in utility evaluation due to data spontaneity.
Abstract
The goal of voice anonymization is to modify an audio such that the true identity of its speaker is hidden. Research on this task is typically limited to the same English read speech datasets, thus the efficacy of current methods for other types of speech data remains unknown. In this paper, we present the first investigation of voice anonymization for the multilingual phenomenon of code-switching speech. We prepare two corpora for this task and propose adaptations to a multilingual anonymization model to make it applicable for code-switching speech. By testing the anonymization performance of this and two language-independent methods on the datasets, we find that only the multilingual system performs well in terms of privacy and utility preservation. Furthermore, we observe challenges in performing utility evaluations on this data because of its spontaneous character and the limited…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Voice and Speech Disorders · Speech and Audio Processing
