SpecWav-Attack: Leveraging Spectrogram Resizing and Wav2Vec 2.0 for Attacking Anonymized Speech
Yuqi Li, Yuanzhong Zheng, Zhongtian Guo, Yaoxuan Wang, Jianjun Yin, Haojun Fei

TL;DR
This paper introduces SpecWav-Attack, an adversarial approach that combines spectrogram resizing and Wav2Vec 2.0 to effectively attack anonymized speech systems, exposing their vulnerabilities and highlighting the need for enhanced defenses.
Contribution
It proposes a novel adversarial attack method leveraging spectrogram resizing and Wav2Vec 2.0, outperforming existing attacks on anonymized speech datasets.
Findings
Outperforms conventional attacks on librispeech datasets
Reveals vulnerabilities in anonymized speech systems
Highlights the need for stronger defenses
Abstract
This paper presents SpecWav-Attack, an adversarial model for detecting speakers in anonymized speech. It leverages Wav2Vec2 for feature extraction and incorporates spectrogram resizing and incremental training for improved performance. Evaluated on librispeech-dev and librispeech-test, SpecWav-Attack outperforms conventional attacks, revealing vulnerabilities in anonymized speech systems and emphasizing the need for stronger defenses, benchmarked against the ICASSP 2025 Attacker Challenge.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection
