SA: Sliding attack for synthetic speech detection with resistance to   clipping and self-splicing

Deng JiaCheng; Dong Li; Yan Diqun; Wang Rangding; Zeng Jiaming

arXiv:2208.13066·cs.SD·October 13, 2022

SA: Sliding attack for synthetic speech detection with resistance to clipping and self-splicing

Deng JiaCheng, Dong Li, Yan Diqun, Wang Rangding, Zeng Jiaming

PDF

Open Access

TL;DR

This paper introduces the sliding attack, a new adversarial attack method for synthetic speech detection that improves transferability of adversarial examples across different input lengths and models, especially under clipping and self-splicing conditions.

Contribution

The paper proposes a novel sliding attack method that enhances transferability of adversarial examples in audio models with varying input lengths and features, addressing practical clipping and self-splicing scenarios.

Findings

01

Significantly improves transferability of adversarial examples after clipping or self-splicing.

02

Enhances transferability between models based on different features.

03

Effective against state-of-the-art synthetic speech detection models.

Abstract

Deep neural networks are vulnerable to adversarial examples that mislead models with imperceptible perturbations. In audio, although adversarial examples have achieved incredible attack success rates on white-box settings and black-box settings, most existing adversarial attacks are constrained by the input length. A More practical scenario is that the adversarial examples must be clipped or self-spliced and input into the black-box model. Therefore, it is necessary to explore how to improve transferability in different input length settings. In this paper, we take the synthetic speech detection task as an example and consider two representative SOTA models. We observe that the gradients of fragments with the same sample value are similar in different models via analyzing the gradients obtained by feeding samples into the model after cropping or self-splicing. Inspired by the above…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Speech Recognition and Synthesis · Advanced Neural Network Applications