# MASleepNet: A Sleep Staging Model Integrating Multi-Scale Convolution and Attention Mechanisms

**Authors:** Zhiyuan Wang, Zian Gong, Tengjie Wang, Qi Dong, Zhentao Huang, Shanwen Zhang, Yahong Ma

PMC · DOI: 10.3390/biomimetics10100642 · Biomimetics · 2025-09-23

## TL;DR

MASleepNet is a new deep learning model that improves automatic sleep staging by combining multi-scale convolution and attention mechanisms.

## Contribution

The novel integration of multi-scale convolution and attention mechanisms in a multimodal sleep staging model.

## Key findings

- MASleepNet achieved 82.56% accuracy on the Sleep-EDF-78 dataset.
- The model reached 84.53% accuracy on the Sleep-EDF-20 dataset.
- Combining multimodal signals with attention mechanisms improves sleep staging efficiency.

## Abstract

With the rapid development of modern industry, people’s living pressures are gradually increasing, and an increasing number of individuals are affected by sleep disorders such as insomnia, hypersomnia, and sleep apnea syndrome. Many cardiovascular and psychiatric diseases are also closely related to sleep. Therefore, the early detection, accurate diagnosis, and treatment of sleep disorders an urgent research priority. Traditional manual sleep staging methods have many problems, such as being time-consuming and cumbersome, relying on expert experience, or being subjective. To address these issues, researchers have proposed multiple algorithmic strategies for sleep staging automation based on deep learning in recent years. This paper studies MASleepNet, a sleep staging neural network model that integrates multimodal deep features. This model takes multi-channel Polysomnography (PSG) signals (including EEG (Fpz-Cz, Pz-Oz), EOG, and EMG) as input and employs a multi-scale convolutional module to extract features at different time scales in parallel. It then adaptively weights and fuses the features from each modality using a channel-wise attention mechanism. The integrated temporal features are integrated into a Bidirectional Long Short-Term Memory (BiLSTM) sequence encoder, where an attention mechanism is introduced to identify key temporal segments. The final classification result is produced by the fully connected layer. The proposed model was experimentally evaluated on the Sleep-EDF dataset (consisting of two subsets, Sleep-EDF-78 and Sleep-EDF-20), achieving classification accuracies of 82.56% and 84.53% on the two subsets, respectively. These results demonstrate that deep models that integrate multimodal signals and an attention mechanism offer the possibility to enhance the efficiency of automatic sleep staging compared to cutting-edge methods.

## Linked entities

- **Diseases:** insomnia (MONDO:0013600), hypersomnia (MONDO:0005466), sleep apnea syndrome (MONDO:0005296)

## Full-text entities

- **Diseases:** cardiovascular and psychiatric diseases (MESH:D002318), sleep apnea syndrome (MESH:D012891), hypersomnia (MESH:D006970), sleep disorders (MESH:D012893), insomnia (MESH:D007319)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12561869/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12561869/full.md

## References

37 references — full list in the complete paper: https://tomesphere.com/paper/PMC12561869/full.md

---
Source: https://tomesphere.com/paper/PMC12561869