# Nightingale: A multimodal approach combining EEG signals and audio features to enhance music therapy

**Authors:** Zirui Chen, Nikhil Yadav

PMC · DOI: 10.1371/journal.pone.0341497 · 2026-01-27

## TL;DR

Nightingale uses EEG and audio data to better predict emotional responses to music, improving music therapy effectiveness.

## Contribution

A novel multimodal approach combining EEG and audio features for accurate emotional response prediction in music therapy.

## Key findings

- Nightingale achieved MAPE values of 19.52 for valence and 22.16 for arousal.
- The model has an R squared of 0.67 for valence and arousal domains.
- The approach uses less computational resources than existing state-of-the-art methods.

## Abstract

Music therapy has emerged as a promising, yet underutilized, treatment modality in various clinical settings. This paper presents Nightingale, a novel multimodal approach that integrates audio features with electroencephalogram (EEG) signals to predict a subject’s emotional response to music. The accuracy of emotional response predictions is enhanced using audio features coupled with EEG data extracted from the Dataset for Emotion Analysis using Physiological and Audiovisual Signals (DEAP). These combined modalities are used to develop a Multilayer Perceptron and Convolutional Neural Network architecture, achieving a Mean Absolute Percentage Error (MAPE) values of 19.52 for valence, 22.16 for arousal, and an R squared of 0.67 for the two domains. This approach offers superior performance to the most recent state-of-the-art in the field. Additionally, it requires significantly less computational resources and a simple network structure, providing an evidence-based prediction technique that could make music therapy more effective and accessible.

## Figures

17 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12844538/full.md

---
Source: https://tomesphere.com/paper/PMC12844538