The VoicePrivacy 2022 Challenge: Progress and Perspectives in Voice   Anonymisation

Michele Panariello; Natalia Tomashenko; Xin Wang; Xiaoxiao Miao,; Pierre Champion; Hubert Nourtel; Massimiliano Todisco; Nicholas Evans,; Emmanuel Vincent; Junichi Yamagishi

arXiv:2407.11516·eess.AS·July 17, 2024·IEEE ACM Trans. Audio Speech Lang. Process.

The VoicePrivacy 2022 Challenge: Progress and Perspectives in Voice Anonymisation

Michele Panariello, Natalia Tomashenko, Xin Wang, Xiaoxiao Miao,, Pierre Champion, Hubert Nourtel, Massimiliano Todisco, Nicholas Evans,, Emmanuel Vincent, Junichi Yamagishi

PDF

Open Access

TL;DR

The VoicePrivacy 2022 Challenge evaluates voice anonymisation methods, revealing trade-offs between privacy and utility, with voice conversion and hybrid approaches showing promising results.

Contribution

This paper provides a comprehensive overview and analysis of the second VoicePrivacy Challenge, including datasets, attack models, evaluation metrics, and system performances.

Findings

01

Voice conversion methods better preserve utility.

02

Hybrid approaches combining ASR and synthesis enhance privacy.

03

A fundamental privacy-utility trade-off persists in current solutions.

Abstract

The VoicePrivacy Challenge promotes the development of voice anonymisation solutions for speech technology. In this paper we present a systematic overview and analysis of the second edition held in 2022. We describe the voice anonymisation task and datasets used for system development and evaluation, present the different attack models used for evaluation, and the associated objective and subjective metrics. We describe three anonymisation baselines, provide a summary description of the anonymisation systems developed by challenge participants, and report objective and subjective evaluation results for all. In addition, we describe post-evaluation analyses and a summary of related work reported in the open literature. Results show that solutions based on voice conversion better preserve utility, that an alternative which combines automatic speech recognition with synthesis achieves…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis