Timbre-reserved Adversarial Attack in Speaker Identification

Qing Wang; Jixun Yao; Li Zhang; Pengcheng Guo; and Lei Xie

arXiv:2309.00929·cs.SD·September 6, 2023

Timbre-reserved Adversarial Attack in Speaker Identification

Qing Wang, Jixun Yao, Li Zhang, Pengcheng Guo, and Lei Xie

PDF

Open Access

TL;DR

This paper introduces a novel adversarial attack method for speaker identification systems that preserves the target speaker's timbre while effectively fooling the system, enhancing attack specificity.

Contribution

It proposes a timbre-reserved adversarial attack framework using adversarial constraints during voice conversion training to generate targeted, speaker-specific adversarial audio.

Findings

01

Successfully fools SID systems with preserved speaker timbre

02

Outperforms existing adversarial attack methods in specificity

03

Demonstrates robustness across different SID models

Abstract

As a type of biometric identification, a speaker identification (SID) system is confronted with various kinds of attacks. The spoofing attacks typically imitate the timbre of the target speakers, while the adversarial attacks confuse the SID system by adding a well-designed adversarial perturbation to an arbitrary speech. Although the spoofing attack copies a similar timbre as the victim, it does not exploit the vulnerability of the SID model and may not make the SID system give the attacker's desired decision. As for the adversarial attack, despite the SID system can be led to a designated decision, it cannot meet the specified text or speaker timbre requirements for the specific attack scenarios. In this study, to make the attack in SID not only leverage the vulnerability of the SID model but also reserve the timbre of the target speaker, we propose a timbre-reserved adversarial…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Speech Recognition and Synthesis · Digital Media Forensic Detection