sDREAMER: Self-distilled Mixture-of-Modality-Experts Transformer for   Automatic Sleep Staging

Jingyuan Chen; Yuan Yao; Mie Anderson; Natalie Hauglund; Celia; Kjaerby; Verena Untiet; Maiken Nedergaard; Jiebo Luo

arXiv:2501.16329·cs.LG·January 28, 2025

sDREAMER: Self-distilled Mixture-of-Modality-Experts Transformer for Automatic Sleep Staging

Jingyuan Chen, Yuan Yao, Mie Anderson, Natalie Hauglund, Celia, Kjaerby, Verena Untiet, Maiken Nedergaard, Jiebo Luo

PDF

TL;DR

sDREAMER is a novel transformer-based sleep staging model that enhances cross-modality interaction and can handle various input sources, outperforming existing methods in accuracy for both single and multi-channel EEG and EMG signals.

Contribution

The paper introduces sDREAMER, a self-distilled mixture-of-modality-experts transformer that improves sleep stage classification by better integrating multi-modal signals and supporting flexible input configurations.

Findings

01

Outperforms existing transformer-based sleep scoring methods in multi-channel inference.

02

Achieves superior accuracy in single-channel sleep staging compared to prior models.

03

Demonstrates effective cross-modality information interaction through self-distillation.

Abstract

Automatic sleep staging based on electroencephalography (EEG) and electromyography (EMG) signals is an important aspect of sleep-related research. Current sleep staging methods suffer from two major drawbacks. First, there are limited information interactions between modalities in the existing methods. Second, current methods do not develop unified models that can handle different sources of input. To address these issues, we propose a novel sleep stage scoring model sDREAMER, which emphasizes cross-modality interaction and per-channel performance. Specifically, we develop a mixture-of-modality-expert (MoME) model with three pathways for EEG, EMG, and mixed signals with partially shared weights. We further propose a self-distillation training scheme for further information interaction across modalities. Our model is trained with multi-channel inputs and can make classifications on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.