Perceptual Musical Features for Interpretable Audio Tagging
Vassilis Lyberatos, Spyridon Kantarelis, Edmund Dervakos, Giorgos, Stamou

TL;DR
This paper introduces an interpretable music tagging approach combining symbolic knowledge, neural networks, and perceptual features, achieving competitive results while enhancing transparency in audio classification.
Contribution
The study presents a novel workflow integrating multiple interpretability techniques for music tagging, addressing the lack of interpretability in neural network-based audio classification.
Findings
Outperforms baseline models on MTG-Jamendo and GTZAN datasets
Achieves competitive performance with state-of-the-art methods
Highlights the value of interpretability over marginal performance loss
Abstract
In the age of music streaming platforms, the task of automatically tagging music audio has garnered significant attention, driving researchers to devise methods aimed at enhancing performance metrics on standard datasets. Most recent approaches rely on deep neural networks, which, despite their impressive performance, possess opacity, making it challenging to elucidate their output for a given input. While the issue of interpretability has been emphasized in other fields like medicine, it has not received attention in music-related tasks. In this study, we explored the relevance of interpretability in the context of automatic music tagging. We constructed a workflow that incorporates three different information extraction techniques: a) leveraging symbolic knowledge, b) utilizing auxiliary deep neural networks, and c) employing signal processing to extract perceptual features from audio…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Diverse Musicological Studies · Neuroscience and Music Perception
