R&B -- Rhythm and Brain: Cross-subject Decoding of Music from Human Brain Activity
Matteo Ferrante, Matteo Ciferri, Nicola Toschi

TL;DR
This paper demonstrates a novel method for decoding and identifying music from human brain activity using fMRI data, advanced neural representations, and cross-subject alignment techniques, achieving state-of-the-art accuracy.
Contribution
It introduces a cross-subject decoding framework that combines functional alignment and pre-trained audio models to identify music from brain activity.
Findings
State-of-the-art music identification accuracy from fMRI data
Effective cross-subject decoding of neural responses to music
Identification of key brain regions involved in music perception
Abstract
Music is a universal phenomenon that profoundly influences human experiences across cultures. This study investigates whether music can be decoded from human brain activity measured with functional MRI (fMRI) during its perception. Leveraging recent advancements in extensive datasets and pre-trained computational models, we construct mappings between neural data and latent representations of musical stimuli. Our approach integrates functional and anatomical alignment techniques to facilitate cross-subject decoding, addressing the challenges posed by the low temporal resolution and signal-to-noise ratio (SNR) in fMRI data. Starting from the GTZan fMRI dataset, where five participants listened to 540 musical stimuli from 10 different genres while their brain activity was recorded, we used the CLAP (Contrastive Language-Audio Pretraining) model to extract latent representations of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeuroscience and Music Perception · Music and Audio Processing
