Interpretable Spectrum Transformation Attacks to Speaker Recognition

Jiadi Yao; Hong Luo; and Xiao-Lei Zhang

arXiv:2302.10686·cs.SD·February 22, 2023

Interpretable Spectrum Transformation Attacks to Speaker Recognition

Jiadi Yao, Hong Luo, and Xiao-Lei Zhang

PDF

Open Access

TL;DR

This paper introduces a spectral transformation attack framework using modified discrete cosine transform to enhance transferability and interpretability of adversarial voices in speaker recognition, with visual explanations via saliency maps.

Contribution

It proposes a novel time-frequency domain attack method, STA-MDCT, improving transferability and interpretability over existing time-domain approaches in speaker recognition adversarial attacks.

Findings

01

STA-MDCT significantly outperforms existing attack methods.

02

Saliency maps provide clear interpretability of attack success.

03

Ensemble of surrogate models enhances attack transferability.

Abstract

The success of adversarial attacks to speaker recognition is mainly in white-box scenarios. When applying the adversarial voices that are generated by attacking white-box surrogate models to black-box victim models, i.e. \textit{transfer-based} black-box attacks, the transferability of the adversarial voices is not only far from satisfactory, but also lacks interpretable basis. To address these issues, in this paper, we propose a general framework, named spectral transformation attack based on modified discrete cosine transform (STA-MDCT), to improve the transferability of the adversarial voices to a black-box victim model. Specifically, we first apply MDCT to the input voice. Then, we slightly modify the energy of different frequency bands for capturing the salient regions of the adversarial noise in the time-frequency domain that are critical to a successful attack. Unlike existing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Speech Recognition and Synthesis · Anomaly Detection Techniques and Applications

MethodsDiscrete Cosine Transform · Class-activation map