On the invertibility of a voice privacy system using embedding alignement
Pierre Champion (MULTISPEECH, LIUM), Thomas Thebaud (LIUM), Ga\"el Le, Lan, Anthony Larcher (LIUM), Denis Jouvet (MULTISPEECH)

TL;DR
This paper demonstrates that voice anonymization systems based on embedding transformations can be approximately inverted using rotation alignment techniques, recovering a significant portion of original speaker identities.
Contribution
It introduces a method using Wasserstein-Procrustes and Procrustes analysis to approximate and invert voice anonymization transformations as rotations.
Findings
Rotation approximation can recover up to 62% of speaker identities.
Complex anonymization systems can be modeled as rotations in embedding space.
Embedding alignment techniques reveal vulnerabilities in voice privacy systems.
Abstract
This paper explores various attack scenarios on a voice anonymization system using embeddings alignment techniques. We use Wasserstein-Procrustes (an algorithm initially designed for unsupervised translation) or Procrustes analysis to match two sets of x-vectors, before and after voice anonymization, to mimic this transformation as a rotation function. We compute the optimal rotation and compare the results of this approximation to the official Voice Privacy Challenge results. We show that a complex system like the baseline of the Voice Privacy Challenge can be approximated by a rotation, estimated using a limited set of x-vectors. This paper studies the space of solutions for voice anonymization within the specific scope of rotations. Rotations being reversible, the proposed method can recover up to 62% of the speaker identities from anonymized embeddings.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInternet Traffic Analysis and Secure E-voting · Privacy-Preserving Technologies in Data
MethodsProcrustes
