Protecting gender and identity with disentangled speech representations

Dimitrios Stoidis; Andrea Cavallaro

arXiv:2104.11051·cs.SD·June 18, 2021

Protecting gender and identity with disentangled speech representations

Dimitrios Stoidis, Andrea Cavallaro

PDF

TL;DR

This paper introduces a variational autoencoder approach that disentangles gender and identity information in speech to enhance privacy, effectively preventing gender and speaker verification from being inferred.

Contribution

It presents a novel method for disentangling gender and identity in speech representations using a variational autoencoder, improving privacy preservation.

Findings

01

Gender recognition accuracy drops to random chance

02

Speaker verification accuracy drops to random chance

03

Effective protection against classification-based attacks

Abstract

Besides its linguistic content, our speech is rich in biometric information that can be inferred by classifiers. Learning privacy-preserving representations for speech signals enables downstream tasks without sharing unnecessary, private information about an individual. In this paper, we show that protecting gender information in speech is more effective than modelling speaker-identity information only when generating a non-sensitive representation of speech. Our method relies on reconstructing speech by decoding linguistic content along with gender information using a variational autoencoder. Specifically, we exploit disentangled representation learning to encode information about different attributes into separate subspaces that can be factorised independently. We present a novel way to encode gender information and disentangle two sensitive biometric identifiers, namely gender and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.