# Towards the Development of a Deep Learning Framework Using Adaptive and Non-Adaptive Time-Frequency Features for EEG-Based Depression Therapy Prediction

**Authors:** Hesam Akbari, Sara Bagherzadeh, Javid Farhadi Sedehi, Rab Nawaz, Reza Rostami, Reza Kazemi, Sadiq Muhammad, Haihua Chen, Mutlu Mete

PMC · DOI: 10.3390/brainsci16030301 · 2026-03-09

## TL;DR

This study develops a deep learning framework using EEG signals to predict whether patients will respond to depression therapies like SSRIs or rTMS before treatment begins.

## Contribution

The study introduces a therapy-specific deep learning framework using adaptive and non-adaptive time-frequency features for depression therapy prediction.

## Key findings

- CWT-based features with ResNet-18 achieved 99.43% accuracy for SSRI outcome prediction.
- VMD-based features with ResNet-18 achieved 98.77% accuracy for rTMS outcome prediction.
- ResNet-18 and TinyViT-Hybrid outperformed other CNN architectures across all conditions.

## Abstract

Background/Objectives: Predicting individual response to depression therapy prior to treatment initiation remains a critical clinical challenge, as the response rate to both selective serotonin reuptake inhibitors (SSRIs) and repetitive transcranial magnetic stimulation (rTMS) is approximately 50%, leaving treatment selection largely trial-based. This study presents a computer-aided decision (CAD) framework that predicts depression therapy outcomes from pre-treatment electroencephalogram (EEG) signals using advanced time-frequency representations and pretrained convolutional neural networks (CNNs). Methods: EEG signals from 30 SSRI patients and 46 rTMS patients are transformed into time-frequency images using Continuous Wavelet Transform (CWT), Variational Mode Decomposition (VMD), and their pixel-level fusion. Four pretrained CNN architectures, including ResNet-18, MobileNet-V3, EfficientNet-B0, and TinyViT-Hybrid, are fine-tuned and evaluated under both image-independent and subject-independent 6-fold cross-validation (CV). Results: Results reveal a clear therapy-specific pattern: CWT-based representations yield superior discrimination for SSRI outcome prediction, with ResNet-18 achieving 99.43% image-level accuracy, while VMD-based representations are statistically superior for rTMS outcome prediction, with ResNet-18 reaching 98.77%. Pixel-level fusion of CWT and VMD does not consistently improve performance over the best individual representation in either therapy context. Pairwise Wilcoxon signed-rank tests confirm a two-tier architectural hierarchy in which ResNet-18 and TinyViT-Hybrid significantly outperform MobileNet-V3 and EfficientNet-B0 across all conditions, while remaining statistically indistinguishable from each other. At the subject level, the framework achieves 82.50% and 83.53% accuracy for SSRI and rTMS, respectively, under strict subject-independent evaluation. Per-channel analysis reveals occipital dominance for SSRI under CWT and frontotemporal dominance for rTMS under VMD, consistent with known neurophysiological mechanisms. Conclusions: These findings demonstrate that the choice of time-frequency representation is therapy-specific and at least as important as architectural complexity, and that competitive performance can be achieved without recurrent or attention layers by combining well-designed spectral images with a simple pretrained residual network.

## Linked entities

- **Diseases:** depression (MONDO:0002050)

## Full-text entities

- **Diseases:** injury to (MESH:D014947), CAD (MESH:C000719218), death (MESH:D003643), Depression (MESH:D003866), mood disorder (MESH:D019964), self-harm (MESH:D012652), VMD (MESH:C537734)
- **Chemicals:** CWT (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

15 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13025224/full.md

---
Source: https://tomesphere.com/paper/PMC13025224