Label-Efficient Sleep Staging Using Transformers Pre-trained with   Position Prediction

Sayeri Lala; Hanlin Goh; Christopher Sandino

arXiv:2404.15308·eess.SP·April 25, 2024

Label-Efficient Sleep Staging Using Transformers Pre-trained with Position Prediction

Sayeri Lala, Hanlin Goh, Christopher Sandino

PDF

Open Access

TL;DR

This paper introduces a new self-supervised learning approach for sleep staging using transformers, which improves performance and reduces labeled data requirements by pretraining the entire model rather than just parts of it.

Contribution

It proposes a coupled architecture and pretraining scheme that pretrains the entire model, leading to better performance and less dependence on large labeled datasets.

Findings

01

Performance gains of 3-5% in sleep staging accuracy.

02

Reduces labeled data needs by approximately 800 subjects.

03

Performance does not saturate with more labeled data.

Abstract

Sleep staging is a clinically important task for diagnosing various sleep disorders, but remains challenging to deploy at scale because it because it is both labor-intensive and time-consuming. Supervised deep learning-based approaches can automate sleep staging but at the expense of large labeled datasets, which can be unfeasible to procure for various settings, e.g., uncommon sleep disorders. While self-supervised learning (SSL) can mitigate this need, recent studies on SSL for sleep staging have shown performance gains saturate after training with labeled data from only tens of subjects, hence are unable to match peak performance attained with larger datasets. We hypothesize that the rapid saturation stems from applying a sub-optimal pretraining scheme that pretrains only a portion of the architecture, i.e., the feature encoder, but not the temporal encoder; therefore, we propose…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing