Adversarial Attacks in Sound Event Classification
Vinod Subramanian, Emmanouil Benetos, Ning Xu, SKoT McDonald, Mark, Sandler

TL;DR
This paper demonstrates that gradient-based adversarial attacks can effectively fool various deep learning models in sound event classification, even with minimal input perturbations.
Contribution
It applies multiple adversarial attack algorithms to diverse sound classification models and evaluates their effectiveness across different architectures.
Findings
Adversarial attacks can be generated with high confidence and low perturbation.
Attacks are highly effective across different model architectures.
Abstract
Adversarial attacks refer to a set of methods that perturb the input to a classification model in order to fool the classifier. In this paper we apply different gradient based adversarial attack algorithms on five deep learning models trained for sound event classification. Four of the models use mel-spectrogram input and one model uses raw audio input. The models represent standard architectures such as convolutional, recurrent and dense networks. The dataset used for training is the Freesound dataset released for task 2 of the DCASE 2018 challenge and the models used are from participants of the challenge who open sourced their code. Our experiments show that adversarial attacks can be generated with high confidence and low perturbation. In addition, we show that the adversarial attacks are very effective across the different models.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Digital Media Forensic Detection · Anomaly Detection Techniques and Applications
