SpecWav-Attack: Leveraging Spectrogram Resizing and Wav2Vec 2.0 for Attacking Anonymized Speech

Yuqi Li; Yuanzhong Zheng; Zhongtian Guo; Yaoxuan Wang; Jianjun Yin; Haojun Fei

arXiv:2505.09616·cs.SD·May 16, 2025

SpecWav-Attack: Leveraging Spectrogram Resizing and Wav2Vec 2.0 for Attacking Anonymized Speech

Yuqi Li, Yuanzhong Zheng, Zhongtian Guo, Yaoxuan Wang, Jianjun Yin, Haojun Fei

PDF

Open Access

TL;DR

This paper introduces SpecWav-Attack, an adversarial approach that combines spectrogram resizing and Wav2Vec 2.0 to effectively attack anonymized speech systems, exposing their vulnerabilities and highlighting the need for enhanced defenses.

Contribution

It proposes a novel adversarial attack method leveraging spectrogram resizing and Wav2Vec 2.0, outperforming existing attacks on anonymized speech datasets.

Findings

01

Outperforms conventional attacks on librispeech datasets

02

Reveals vulnerabilities in anonymized speech systems

03

Highlights the need for stronger defenses

Abstract

This paper presents SpecWav-Attack, an adversarial model for detecting speakers in anonymized speech. It leverages Wav2Vec2 for feature extraction and incorporates spectrogram resizing and incremental training for improved performance. Evaluated on librispeech-dev and librispeech-test, SpecWav-Attack outperforms conventional attacks, revealing vulnerabilities in anonymized speech systems and emphasizing the need for stronger defenses, benchmarked against the ICASSP 2025 Attacker Challenge.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection