Symmetric Saliency-based Adversarial Attack To Speaker Identification

Jiadi Yao; Xing Chen; Xiao-Lei Zhang; Wei-Qiang Zhang; Kunde Yang

arXiv:2210.16777·cs.SD·February 28, 2023·1 cites

Symmetric Saliency-based Adversarial Attack To Speaker Identification

Jiadi Yao, Xing Chen, Xiao-Lei Zhang, Wei-Qiang Zhang, Kunde Yang

PDF

Open Access

TL;DR

This paper introduces a novel symmetric saliency-based encoder-decoder approach for generating adversarial voice examples that effectively attack speaker identification systems with high success rates and low computational cost.

Contribution

It proposes a new generation network with saliency map decoding and an angular loss function, achieving state-of-the-art attack success rates on speaker identification tasks.

Findings

01

Over 97% targeted attack success rate

02

Signal-to-noise ratio over 39 dB

03

Low computational cost

Abstract

Adversarial attack approaches to speaker identification either need high computational cost or are not very effective, to our knowledge. To address this issue, in this paper, we propose a novel generation-network-based approach, called symmetric saliency-based encoder-decoder (SSED), to generate adversarial voice examples to speaker identification. It contains two novel components. First, it uses a novel saliency map decoder to learn the importance of speech samples to the decision of a targeted speaker identification system, so as to make the attacker focus on generating artificial noise to the important samples. It also proposes an angular loss function to push the speaker embedding far away from the source speaker. Our experimental results demonstrate that the proposed SSED yields the state-of-the-art performance, i.e. over 97% targeted attack success rate and a signal-to-noise level…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Adversarial Robustness in Machine Learning · Digital Media Forensic Detection