SALT: Distinguishable Speaker Anonymization Through Latent Space   Transformation

Yuanjun Lv; Jixun Yao; Peikun Chen; Hongbin Zhou; Heng Lu; Lei Xie

arXiv:2310.05051·cs.SD·October 10, 2023

SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation

Yuanjun Lv, Jixun Yao, Peikun Chen, Hongbin Zhou, Heng Lu, Lei Xie

PDF

Open Access 1 Repo

TL;DR

SALT introduces a novel speaker anonymization method using latent space interpolation and extrapolation, achieving high speaker distinctiveness while maintaining speech quality and intelligibility, especially for out-of-distribution speakers.

Contribution

The paper presents SALT, a new speaker anonymization system leveraging latent space transformation with interpolation and extrapolation techniques for improved diversity and distinctiveness.

Findings

01

Achieves state-of-the-art speaker distinctiveness metrics.

02

Maintains speech quality and intelligibility.

03

Effective for out-of-distribution speakers.

Abstract

Speaker anonymization aims to conceal a speaker's identity without degrading speech quality and intelligibility. Most speaker anonymization systems disentangle the speaker representation from the original speech and achieve anonymization by averaging or modifying the speaker representation. However, the anonymized speech is subject to reduction in pseudo speaker distinctiveness, speech quality and intelligibility for out-of-distribution speaker. To solve this issue, we propose SALT, a Speaker Anonymization system based on Latent space Transformation. Specifically, we extract latent features by a self-supervised feature extractor and randomly sample multiple speakers and their weights, and then interpolate the latent vectors to achieve speaker anonymization. Meanwhile, we explore the extrapolation method to further extend the diversity of pseudo speakers. Experiments on Voice Privacy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bakerbunker/salt
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Voice and Speech Disorders