You Are What You Say: Exploiting Linguistic Content for VoicePrivacy Attacks

\"Unal Ege Gaznepoglu; Anna Leschanowsky; Ahmad Aloradi; Prachi Singh; Daniel Tenbrinck; Emanu\"el A. P. Habets; Nils Peters

arXiv:2506.09521·eess.AS·May 21, 2026

You Are What You Say: Exploiting Linguistic Content for VoicePrivacy Attacks

\"Unal Ege Gaznepoglu, Anna Leschanowsky, Ahmad Aloradi, Prachi Singh, Daniel Tenbrinck, Emanu\"el A. P. Habets, Nils Peters

PDF

TL;DR

This paper demonstrates that linguistic content can significantly compromise speaker anonymization systems, revealing privacy vulnerabilities through adapted language models and suggesting dataset improvements.

Contribution

It introduces a BERT-based ASV attack that exposes linguistic content vulnerabilities in speaker anonymization, highlighting the need for better dataset design and evaluation metrics.

Findings

01

Achieved a mean EER of 35% using linguistic content-based attack.

02

Certain speakers had EERs as low as 2%, indicating high privacy risk.

03

Revealed that dataset curation influences attack success and evaluation fairness.

Abstract

Speaker anonymization systems hide the identity of speakers while preserving other information such as linguistic content and emotions. To evaluate their privacy benefits, attacks in the form of automatic speaker verification (ASV) systems are employed. In this study, we assess the impact of intra-speaker linguistic content similarity in the attacker training and evaluation datasets, by adapting BERT, a language model, as an ASV system. On the VoicePrivacy Attacker Challenge datasets, our method achieves a mean equal error rate (EER) of 35%, with certain speakers attaining EERs as low as 2%, based solely on the textual content of their utterances. Our explainability study reveals that the system decisions are linked to semantically similar keywords within utterances, stemming from how LibriSpeech is curated. Our study suggests reworking the VoicePrivacy datasets to ensure a fair and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Topic Modeling · Hate Speech and Cyberbullying Detection

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Layer Normalization · Linear Warmup With Linear Decay · Attention Dropout · Softmax · Linear Layer · Dropout · Dense Connections · Attention Is All You Need · WordPiece