Multi-Discriminator Sobolev Defense-GAN Against Adversarial Attacks for End-to-End Speech Systems
Mohammad Esmaeilpour, Patrick Cardinal, Alessandro Lameiras, Koerich

TL;DR
This paper presents a novel defense method for speech-to-text systems against adversarial attacks, utilizing a Sobolev GAN-based spectrogram synthesis and a spectrogram subspace projection to enhance robustness and signal quality.
Contribution
The paper introduces a new Sobolev GAN architecture and a spectrogram subspace projection method for improved adversarial defense in speech systems.
Findings
Outperforms state-of-the-art defenses in accuracy.
Effective against six strong white and black-box attacks.
Maintains high signal quality and stability.
Abstract
This paper introduces a defense approach against end-to-end adversarial attacks developed for cutting-edge speech-to-text systems. The proposed defense algorithm has four major steps. First, we represent speech signals with 2D spectrograms using the short-time Fourier transform. Second, we iteratively find a safe vector using a spectrogram subspace projection operation. This operation minimizes the chordal distance adjustment between spectrograms with an additional regularization term. Third, we synthesize a spectrogram with such a safe vector using a novel GAN architecture trained with Sobolev integral probability metric. To improve the model's performance in terms of stability and the total number of learned modes, we impose an additional constraint on the generator network. Finally, we reconstruct the signal from the synthesized spectrogram and the Griffin-Lim phase approximation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Speech Recognition and Synthesis · Adversarial Robustness in Machine Learning
