Interpretable Spectrum Transformation Attacks to Speaker Recognition
Jiadi Yao, Hong Luo, and Xiao-Lei Zhang

TL;DR
This paper introduces a spectral transformation attack framework using modified discrete cosine transform to enhance transferability and interpretability of adversarial voices in speaker recognition, with visual explanations via saliency maps.
Contribution
It proposes a novel time-frequency domain attack method, STA-MDCT, improving transferability and interpretability over existing time-domain approaches in speaker recognition adversarial attacks.
Findings
STA-MDCT significantly outperforms existing attack methods.
Saliency maps provide clear interpretability of attack success.
Ensemble of surrogate models enhances attack transferability.
Abstract
The success of adversarial attacks to speaker recognition is mainly in white-box scenarios. When applying the adversarial voices that are generated by attacking white-box surrogate models to black-box victim models, i.e. \textit{transfer-based} black-box attacks, the transferability of the adversarial voices is not only far from satisfactory, but also lacks interpretable basis. To address these issues, in this paper, we propose a general framework, named spectral transformation attack based on modified discrete cosine transform (STA-MDCT), to improve the transferability of the adversarial voices to a black-box victim model. Specifically, we first apply MDCT to the input voice. Then, we slightly modify the energy of different frequency bands for capturing the salient regions of the adversarial noise in the time-frequency domain that are critical to a successful attack. Unlike existing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Speech Recognition and Synthesis · Anomaly Detection Techniques and Applications
MethodsDiscrete Cosine Transform · Class-activation map
