Class-Conditional Defense GAN Against End-to-End Speech Attacks

Mohammad Esmaeilpour; Patrick Cardinal; Alessandro Lameiras Koerich

arXiv:2010.11352·cs.SD·February 23, 2021

Class-Conditional Defense GAN Against End-to-End Speech Attacks

Mohammad Esmaeilpour, Patrick Cardinal, Alessandro Lameiras Koerich

PDF

TL;DR

This paper introduces a class-conditional Defense GAN that enhances speech recognition security by reconstructing signals to defend against adversarial attacks, outperforming traditional methods in accuracy and robustness.

Contribution

The paper presents a novel class-conditional Defense GAN approach that reconstructs speech signals without adding noise, improving defense against end-to-end speech adversarial attacks.

Findings

01

Significantly reduces word error rate under attack

02

Improves sentence recognition accuracy

03

Outperforms conventional defense algorithms

Abstract

In this paper we propose a novel defense approach against end-to-end adversarial attacks developed to fool advanced speech-to-text systems such as DeepSpeech and Lingvo. Unlike conventional defense approaches, the proposed approach does not directly employ low-level transformations such as autoencoding a given input signal aiming at removing potential adversarial perturbation. Instead of that, we find an optimal input vector for a class conditional generative adversarial network through minimizing the relative chordal distance adjustment between a given test input and the generator network. Then, we reconstruct the 1D signal from the synthesized spectrogram and the original phase information derived from the given input signal. Hence, this reconstruction does not add any extra noise to the signal and according to our experimental results, our defense-GAN considerably outperforms…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.