SLEEPYLAND: trust begins with fair evaluation of automatic sleep staging models
Alvise Dei Rossi, Matteo Metaldi, Michal Bechny, Irina Filchenko, Julia van der Meer, Markus H. Schmidt, Claudio L.A. Bassetti, Athina Tzovara, Francesca D. Faraci, Luigi Fiorillo

TL;DR
SLEEPYLAND introduces a comprehensive evaluation framework and ensemble models for sleep staging, demonstrating improved robustness, generalization, and alignment with human scorers across diverse datasets and conditions.
Contribution
The paper presents SLEEPYLAND, an open-source framework with pre-trained models and ensemble methods that enhance fair evaluation and generalization of sleep staging models.
Findings
SOMNUS ensemble outperforms individual models and previous state-of-the-art methods.
SOMNUS exceeds human scorer performance on out-of-distribution datasets.
Ensemble disagreement metrics predict scorer disagreement with high accuracy.
Abstract
Despite advances in deep learning for automatic sleep staging, clinical adoption remains limited due to challenges in fair model evaluation, generalization across diverse datasets, model bias, and variability in human annotations. We present SLEEPYLAND, an open-source sleep staging evaluation framework designed to address these barriers. It includes more than 220'000 hours in-domain (ID) sleep recordings, and more than 84'000 hours out-of-domain (OOD) sleep recordings, spanning a broad range of ages, sleep-wake disorders, and hardware setups. We release pre-trained models based on high-performing SoA architectures and evaluate them under standardized conditions across single- and multi-channel EEG/EOG configurations. We introduce SOMNUS, an ensemble combining models across architectures and channel setups via soft voting. SOMNUS achieves robust performance across twenty-four different…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEEG and Brain-Computer Interfaces · Obstructive Sleep Apnea Research · Sleep and related disorders
