MusicLIME: Explainable Multimodal Music Understanding

Theodoros Sotirou; Vassilis Lyberatos; Orfeas Menis Mastromichalakis,; Giorgos Stamou

arXiv:2409.10496·cs.SD·March 19, 2025

MusicLIME: Explainable Multimodal Music Understanding

Theodoros Sotirou, Vassilis Lyberatos, Orfeas Menis Mastromichalakis,, Giorgos Stamou

PDF

Open Access 1 Repo

TL;DR

MusicLIME introduces a novel explanation method for multimodal music models, revealing how audio and lyrics interact in decision-making, thereby enhancing interpretability and trust in music understanding systems.

Contribution

It presents MusicLIME, a model-agnostic explanation technique that captures interactions between audio and lyrical features in multimodal music models.

Findings

01

MusicLIME effectively explains multimodal model decisions.

02

It provides both local and global interpretability.

03

The method improves understanding of feature interactions.

Abstract

Multimodal models are critical for music understanding tasks, as they capture the complex interplay between audio and lyrics. However, as these models become more prevalent, the need for explainability grows-understanding how these systems make decisions is vital for ensuring fairness, reducing bias, and fostering trust. In this paper, we introduce MusicLIME, a model-agnostic feature importance explanation method designed for multimodal music models. Unlike traditional unimodal methods, which analyze each modality separately without considering the interaction between them, often leading to incomplete or misleading explanations, MusicLIME reveals how audio and lyrical features interact and contribute to predictions, providing a holistic view of the model's decision-making. Additionally, we enhance local explanations by aggregating them into global explanations, giving users a broader…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

iamtheo2000/musiclime
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Music Technology and Sound Studies