# MultiScaleSleepNet: A Hybrid CNN–BiLSTM–Transformer Architecture with Multi-Scale Feature Representation for Single-Channel EEG Sleep Stage Classification

**Authors:** Cenyu Liu, Qinglin Guan, Wei Zhang, Liyang Sun, Mengyi Wang, Xue Dong, Shuogui Xu

PMC · DOI: 10.3390/s25206328 · Sensors (Basel, Switzerland) · 2025-10-13

## TL;DR

This paper introduces MultiScaleSleepNet, a deep learning model that improves sleep stage classification from single-channel EEG data using a hybrid architecture and multi-scale features.

## Contribution

The novel hybrid CNN–BiLSTM–Transformer architecture with multi-scale feature extraction for efficient sleep staging on wearable devices.

## Key findings

- MultiScaleSleepNet achieved 88.6% accuracy on the Sleep-EDF dataset with 1.9 million parameters.
- Attention mechanisms and spectral fusion significantly improved performance for stages N1, N3, and REM.
- The model showed robustness across different datasets and class distributions.

## Abstract

Accurate automatic sleep stage classification from single-channel EEG remains challenging due to the need for effective extraction of multiscale neurophysiological features and modeling of long-range temporal dependencies. This study aims to address these limitations by developing an efficient and compact deep learning architecture tailored for wearable and edge device applications. We propose MultiScaleSleepNet, a hybrid convolutional neural network–bidirectional long short-term memory–transformer architecture that extracts multiscale temporal and spectral features through parallel convolutional branches, followed by sequential modeling using a BiLSTM memory network and transformer-based attention mechanisms. The model obtained an accuracy, macro-averaged F1 score, and kappa coefficient of 88.6%, 0.833, and 0.84 on the Sleep-EDF dataset; 85.6%, 0.811, and 0.80 on the Sleep-EDF Expanded dataset; and 84.6%, 0.745, and 0.79 on the SHHS dataset. Ablation studies indicate that attention mechanisms and spectral fusion consistently improve performance, with the most notable gains observed for stages N1, N3, and rapid eye movement. MultiScaleSleepNet demonstrates competitive performance across multiple benchmark datasets while maintaining a compact size of 1.9 million parameters, suggesting robustness to variations in dataset size and class distribution. The study supports the feasibility of real-time, accurate sleep staging from single-channel EEG using parameter-efficient deep models suitable for portable systems.

## Full-text entities

- **Diseases:** sleep deprivation (MESH:D012892), RBD (MESH:D020187), -wave (MESH:C535500), chronic insomnia (MESH:D007319), muscle atonia (MESH:D019042), dysregulation of (MESH:D021081), sleep disorders (MESH:D012893), injury to (MESH:D014947), pulmonary, cardiovascular, and coronary diseases (MESH:D002318), impaired cognition (MESH:D003072)
- **Chemicals:** temazepam (MESH:D013693), Epoch (MESH:C079446)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12568011/full.md

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12568011/full.md

## References

53 references — full list in the complete paper: https://tomesphere.com/paper/PMC12568011/full.md

---
Source: https://tomesphere.com/paper/PMC12568011