Multi-Discriminator Sobolev Defense-GAN Against Adversarial Attacks for   End-to-End Speech Systems

Mohammad Esmaeilpour; Patrick Cardinal; Alessandro Lameiras; Koerich

arXiv:2103.08086·cs.SD·March 16, 2021

Multi-Discriminator Sobolev Defense-GAN Against Adversarial Attacks for End-to-End Speech Systems

Mohammad Esmaeilpour, Patrick Cardinal, Alessandro Lameiras, Koerich

PDF

Open Access

TL;DR

This paper presents a novel defense method for speech-to-text systems against adversarial attacks, utilizing a Sobolev GAN-based spectrogram synthesis and a spectrogram subspace projection to enhance robustness and signal quality.

Contribution

The paper introduces a new Sobolev GAN architecture and a spectrogram subspace projection method for improved adversarial defense in speech systems.

Findings

01

Outperforms state-of-the-art defenses in accuracy.

02

Effective against six strong white and black-box attacks.

03

Maintains high signal quality and stability.

Abstract

This paper introduces a defense approach against end-to-end adversarial attacks developed for cutting-edge speech-to-text systems. The proposed defense algorithm has four major steps. First, we represent speech signals with 2D spectrograms using the short-time Fourier transform. Second, we iteratively find a safe vector using a spectrogram subspace projection operation. This operation minimizes the chordal distance adjustment between spectrograms with an additional regularization term. Third, we synthesize a spectrogram with such a safe vector using a novel GAN architecture trained with Sobolev integral probability metric. To improve the model's performance in terms of stability and the total number of learned modes, we impose an additional constraint on the generator network. Finally, we reconstruct the signal from the synthesized spectrogram and the Griffin-Lim phase approximation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Speech Recognition and Synthesis · Adversarial Robustness in Machine Learning