A Speech Representation Anonymization Framework via Selective Noise   Perturbation

Minh Tran; Mohammad Soleymani

arXiv:2203.14171·eess.AS·October 31, 2022·1 cites

A Speech Representation Anonymization Framework via Selective Noise Perturbation

Minh Tran, Mohammad Soleymani

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel speech anonymization framework that uses selective noise perturbation on high-utility speech representations, balancing privacy and utility without retraining components.

Contribution

It presents a new privacy-preserving speech anonymization method based on noise perturbation guided by a Transformer-based saliency estimator, outperforming existing baselines.

Findings

01

Achieves comparable or better utility than VoicePrivacy2022 baselines.

02

Provides flexible privacy-utility trade-offs without re-training.

03

Maintains privacy levels comparable to existing methods.

Abstract

Privacy and security are major concerns when communicating speech signals to cloud services such as automatic speech recognition (ASR) and speech emotion recognition (SER). Existing solutions for speech anonymization mainly focus on voice conversion or voice modification to convert a raw utterance into another one with similar content but different, or no, identity-related information. However, an alternative approach to share speech data under the form of privacy-preserving representation has been largely under-explored. In this paper, we propose a speech anonymization framework that achieves privacy via noise perturbation to a selected subset of the high-utility representations extracted using a pre-trained speech encoder. The subset is chosen with a Transformer-based privacy-risk saliency estimator. We validate our framework on four tasks, namely, Automatic Speaker Verification…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mtran14/dp_w2v2
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis