Explaining Deep Convolutional Neural Networks on Music Classification
Keunwoo Choi, George Fazekas, Mark Sandler

TL;DR
This paper introduces a method to interpret deep CNNs used in music classification by converting learned features into audio signals, revealing how features evolve across layers and what they capture.
Contribution
It presents a novel auralisation technique for CNNs in music IR, enabling visualization of learned features and understanding their evolution across layers.
Findings
Deep layers capture textures and continuous patterns rather than shapes.
Deconvolution-based auralisation reveals the learned features in audio form.
Features evolve from simple patterns to complex textures across layers.
Abstract
Deep convolutional neural networks (CNNs) have been actively adopted in the field of music information retrieval, e.g. genre classification, mood detection, and chord recognition. However, the process of learning and prediction is little understood, particularly when it is applied to spectrograms. We introduce auralisation of a CNN to understand its underlying mechanism, which is based on a deconvolution procedure introduced in [2]. Auralisation of a CNN is converting the learned convolutional features that are obtained from deconvolution into audio signals. In the experiments and discussions, we explain trained features of a 5-layer CNN based on the deconvolved spectrograms and auralised signals. The pairwise correlations per layers with varying different musical attributes are also investigated to understand the evolution of the learnt features. It is shown that in the deep layers,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Generative Adversarial Networks and Image Synthesis · Neural Networks and Applications
