Deep Learning and Music Adversaries

Corey Kereliuk; Bob L. Sturm; Jan Larsen

arXiv:1507.04761·cs.LG·July 20, 2015

Deep Learning and Music Adversaries

Corey Kereliuk, Bob L. Sturm, Jan Larsen

PDF

TL;DR

This paper explores creating adversarial attacks on deep learning models for music content analysis, demonstrating their effectiveness and comparing robustness across architectures, while also testing adversarial training.

Contribution

It adapts image-based adversarial techniques to audio spectral data and evaluates their impact on different deep learning architectures for music analysis.

Findings

01

Adversaries effectively fool music classification systems.

02

Convolutional networks are more robust than frame-based voting systems.

03

Adversarial training did not improve model resilience.

Abstract

An adversary is essentially an algorithm intent on making a classification system perform in some particular way given an input, e.g., increase the probability of a false negative. Recent work builds adversaries for deep learning systems applied to image object recognition, which exploits the parameters of the system to find the minimal perturbation of the input image such that the network misclassifies it with high confidence. We adapt this approach to construct and deploy an adversary of deep learning systems applied to music content analysis. In our case, however, the input to the systems is magnitude spectral frames, which requires special care in order to produce valid input audio signals from network-derived perturbations. For two different train-test partitionings of two benchmark datasets, and two different deep architectures, we find that this adversary is very effective in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.