Timbre-reserved Adversarial Attack in Speaker Identification
Qing Wang, Jixun Yao, Li Zhang, Pengcheng Guo, and Lei Xie

TL;DR
This paper introduces a novel adversarial attack method for speaker identification systems that preserves the target speaker's timbre while effectively fooling the system, enhancing attack specificity.
Contribution
It proposes a timbre-reserved adversarial attack framework using adversarial constraints during voice conversion training to generate targeted, speaker-specific adversarial audio.
Findings
Successfully fools SID systems with preserved speaker timbre
Outperforms existing adversarial attack methods in specificity
Demonstrates robustness across different SID models
Abstract
As a type of biometric identification, a speaker identification (SID) system is confronted with various kinds of attacks. The spoofing attacks typically imitate the timbre of the target speakers, while the adversarial attacks confuse the SID system by adding a well-designed adversarial perturbation to an arbitrary speech. Although the spoofing attack copies a similar timbre as the victim, it does not exploit the vulnerability of the SID model and may not make the SID system give the attacker's desired decision. As for the adversarial attack, despite the SID system can be led to a designated decision, it cannot meet the specified text or speaker timbre requirements for the specific attack scenarios. In this study, to make the attack in SID not only leverage the vulnerability of the SID model but also reserve the timbre of the target speaker, we propose a timbre-reserved adversarial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Speech Recognition and Synthesis · Digital Media Forensic Detection
