Probing the Feasibility of Multilingual Speaker Anonymization

Sarina Meyer; Florian Lux; Ngoc Thang Vu

arXiv:2407.02937·cs.CL·October 8, 2024

Probing the Feasibility of Multilingual Speaker Anonymization

Sarina Meyer, Florian Lux, Ngoc Thang Vu

PDF

Open Access 1 Repo

TL;DR

This paper explores extending multilingual speaker anonymization technology beyond English, demonstrating its effectiveness across nine languages and highlighting the importance of speech synthesis quality for privacy protection.

Contribution

It introduces a multilingual extension of a speaker anonymization system, enabling privacy preservation across nine languages using language-independent speaker embeddings.

Findings

01

Successful anonymization across nine languages.

02

Speaker embeddings trained on English generalize to other languages.

03

Speech synthesis quality impacts anonymization effectiveness.

Abstract

In speaker anonymization, speech recordings are modified in a way that the identity of the speaker remains hidden. While this technology could help to protect the privacy of individuals around the globe, current research restricts this by focusing almost exclusively on English data. In this study, we extend a state-of-the-art anonymization system to nine languages by transforming language-dependent components to their multilingual counterparts. Experiments testing the robustness of the anonymized speech against privacy attacks and speech deterioration show an overall success of this system for all languages. The results suggest that speaker embeddings trained on English data can be applied across languages, and that the anonymization performance for a language is mainly affected by the quality of the speech synthesis component used for it.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

digitalphonetics/speaker-anonymization
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis