Towards a Flexible Deep Learning Method for Automatic Detection of Clinically Relevant Multi-Modal Events in the Polysomnogram
Alexander Neergaard Olesen, Stanislas Chambon, Valentin Thorey, Poul, Jennum, Emmanuel Mignot, Helge B. D. Sorensen

TL;DR
This paper introduces a deep learning model for automatic detection of sleep-related events like arousals and leg movements, demonstrating promising results and potential for broader sleep analysis applications.
Contribution
A novel deep learning approach incorporating RNNs for detecting sleep events, with optimized configurations for arousal and leg movement detection.
Findings
Arousal detection achieved F1 score of 0.75 with RNN and dynamic window.
Leg movement detection achieved F1 score of 0.65 with static window.
Model shows potential for general sleep analysis, with room for improvement.
Abstract
Much attention has been given to automatic sleep staging algorithms in past years, but the detection of discrete events in sleep studies is also crucial for precise characterization of sleep patterns and possible diagnosis of sleep disorders. We propose here a deep learning model for automatic detection and annotation of arousals and leg movements. Both of these are commonly seen during normal sleep, while an excessive amount of either is linked to disrupted sleep patterns, excessive daytime sleepiness impacting quality of life, and various sleep disorders. Our model was trained on 1,485 subjects and tested on 1,000 separate recordings of sleep. We tested two different experimental setups and found optimal arousal detection was attained by including a recurrent neural network module in our default model with a dynamic default event window (F1 = 0.75), while optimal leg movement…
| train | eval | test | p-value | |
|---|---|---|---|---|
| N | 1,485 | 165 | 1000 | - |
| Age (years) | 0.631 | |||
| BMI () | 0.879 | |||
| AHI () | 0.029 | |||
| AI () | 0.607 | |||
| PLMI () | 0.204 |
| Module | Input dim. | Output dim. | Type | Kernel size | No. kernels | Stride | Activation | |
| 1D convolution | 1 | linear | ||||||
| 1D convolution | 8 | – | ||||||
| Batch norm. | – | 8 | – | ReLU | ||||
| 1D max. pool. | – | – | ||||||
| 1D convolution | – | |||||||
| Batch norm. | – | – | ReLU | |||||
| 1D max. pool. | – | – | ||||||
| bGRU | – | – | – | |||||
| 1D convolution | softmax for each kernel | |||||||
| 1D convolution | linear |
| Model | F1 | Pr | Re |
|---|---|---|---|
| LM, static | |||
| AR, static | |||
| LM, dynamic | |||
| AR, dynamic. | |||
| LM, RNN | |||
| AR, RNN |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Towards a Flexible Deep Learning Method for Automatic Detection of Clinically Relevant Multi-Modal Events in the Polysomnogram
Alexander Neergaard Olesen*†,1,2,3*, Member, IEEE, Stanislas Chambon4,5, Valentin Thorey5,
Poul Jennum3, Emmanuel Mignot2 and Helge B. D. Sorensen3, Senior Member, IEEE Research supported by the Klarman Family Foundation, Technical University of Denmark, and University of Copenhagen with supporting grants from Reinholdt W. Jorck & Wife’s Foundation, Knud Højgaard Foundation, Otto Mønsted Foundation, Vera & Carl Michaelsens Foundation, Augustinus Foundation, and Stibo Foundation.†Corresponding author: alexno@stanford / [email protected]1Department of Health Technology, Technical University of Denmark, 2800 Kgs. Lyngby, Denmark.2Center for Sleep Sciences and Medicine, Stanford University, Palo Alto, CA 94304, USA.3Danish Center for Sleep Medicine, University Hospital Copenhagen, 2600 Glostrup, Denmark4LTCI Telecom ParisTech, Universite Paris-Saclay, Paris, France.5Research & Algorithms Team, Dreem, Paris, France.
Abstract
Much attention has been given to automatic sleep staging algorithms in past years, but the detection of discrete events in sleep studies is also crucial for precise characterization of sleep patterns and possible diagnosis of sleep disorders. We propose here a deep learning model for automatic detection and annotation of arousals and leg movements. Both of these are commonly seen during normal sleep, while an excessive amount of either is linked to disrupted sleep patterns, excessive daytime sleepiness impacting quality of life, and various sleep disorders. Our model was trained on 1,485 subjects and tested on 1,000 separate recordings of sleep. We tested two different experimental setups and found optimal arousal detection was attained by including a recurrent neural network module in our default model with a dynamic default event window (F1 = 0.75), while optimal leg movement detection was attained using a static event window (F1 = 0.65). Our work show promise while still allowing for improvements. Specifically, future research will explore the proposed model as a general-purpose sleep analysis model.
I INTRODUCTION
Analysis of sleep patterns is performed manually by experts in sleep clinics using rules and guidelines defined by the American Academy of Sleep Medicine recently updated in 2018 [1]. These guidelines outline technical and clinical best practices when performing routine polysomnography (PSG), which is an overnight recording of electroencephalography (EEG), electrooculography (EOG), electromyography (EMG) electrocardiography (ECG), respiratory effort and peripheral limb activity. Expert technicians and somnologists use these physiological variables to analyse sleep patterns and diagnose sleep disorders based on key metrics and indices, such as total sleep time, amount of sleep spent in various sleep stages, and the observed number of discrete events per hour of sleep. Specifically, the number of arousals (short awakenings during sleep, 15 s), non-periodic and periodic leg movements (PLM), and the number of apnea events per hour of sleep are summarized in the arousal index (AI), periodic leg movements index (PLMI) and apnea/hypopnea index (AHI), the latter of which is a combination of apneic (no/obstructed respiratory effort) and hypopneic (reduced respiratory effort) events. Excessive amounts of these events are disruptive to normal sleep, which can lead to patient complaints of excessive daytime sleepiness [2], which in turn is linked to an increase in e.g. automotive accidents and reduced quality of life [3]. Increased number of PLMs is also linked to other sleep disorders such as restless legs syndrome, and periodic leg movement disorder [4, 5].
Correct diagnosis of sleep disorders is predicated on precise scoring of sleep stages as well as accurate scoring of these discrete sleep events. However, the current gold standard of manual analysis by experienced technicians is inherently biased and inconsistent.Several studies have shown low inter-rater reliability on both the scoring of sleep stages [6, 7, 8], arousals [9], and respiratory events [10]. Furthermore, manual analysis of PSGs is time-consuming and prone to scorer fatigue. Thus, there is a need for efficient systems that provide deterministic and reliable scorings of sleep studies.
Several recent studies have already explored automatic classification of sleep stages in large cohorts with good results [11, 12, 13, 14, 15], however, the reliable and consistent detection and classification of discrete PSG events in large cohorts remain largely unexplored.
Recent studies on certain microevents in sleep have indicated that sleep spindles and K-complexes can be reliably detected and annotated with start time and duration using deep learning methods [16, 17]. Specifically, these studies proposed a single-shot event detection algorithm, that parallels the YOLO and SSD algorithms used for object detection in 2D images [18, 19], however, they were limited in scope by detecting events only at the EEG level, and did not explicitly take advantage of the temporal connection of the detected events. Additionally, experiments were carried out on a small-scale database [16].
In this study, we focused on the detection of arousals (AR) and leg movements (LM). These events arise from highly distinct physiological sources, EEG and leg EMG, while ARs are also visible in the EOG and chin EMG. These events are important for the precise characterization of sleep patterns and possible diagnosis of sleep disorders, and an accurate detection is therefore of high interest. We extend previous work in [16, 17] by 1) preprocessing and analysing multiple input signals at the same time, and 2) taking into account important temporal context using recurrent neural networks. Furthermore, we apply our model on a larger database than previous studies.
II DATA
II-A MrOS Sleep Study
The MrOS Sleep Study is a part of the larger Osteoporotic Fractures in Men Study with the objective of researching the links between sleep disorders, fractures, cardiovascular disease and mortality in older males ( years) [20, 21, 22]. Between 2003 and 2005, 3,135 of the original 5,994 participants were recruited to undergo full-night PSG recording at six centers in the US at two separate visits (visit 1 and visit 2) with following 3 to 5-day actigraphy studies at home. The resulting PSG studies were subsequently scored by experienced sleep technicians for standard sleep variables including sleep stages, leg movements, arousals, and respiratory events.
II-B Included events and signals
In this study, we only considered the detection of two PSG events, arousals and leg movements. These events are characterized by a start time and a duration, which we extracted from 2,907 PSG studies from visit 1 available from the National Sleep Research Resource repository [23, 24]. From each PSG study, we extracted left and right central EEG, left and right EOG, chin EMG, and EMG from the left and right anterior tibialis. EEG and EOG channels were referenced to the contralateral mastoid process, while a leg EMG channel was synthesized by referencing left to right. Any PSG without the full set of channels or without any event scoring was eliminated from further analysis.
II-C Subset demographics and partitioning
In total, 2,650 out of the 2,907 PSGs available from visit 1 were included in this study. These were partitioned into train, eval, and test sets of sizes 1,485, 165, and 1,000 studies, respectively. A subset of key demographic and PSG variables are presented in Table I.
III METHODS
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] R. B. Berry, C. L. Albertario, S. M. Harding, R. M. Lloyd, D. T. Plante, S. F. Quan, M. M. Troester, and B. V. Vaughn, The AASM Manual for the Scoring of Sleep and Associated Events: Rules, Terminology and Technical Specifications. . Version 2.5. Darien, Il: American Academy of Sleep Medicine, 2018.
- 2[2] P. Halász, M. Terzano, L. Parrino, and R. Bódizs, “The nature of arousal in sleep,” J. Sleep Res. , vol. 13, no. 1, pp. 1–23, 2004.
- 3[3] L. J. Findley, M. E. Unverzagt, and P. M. Suratt, “Automobile Accidents Involving Patients with Obstructive Sleep Apnea,” Am. Rev. Respir. Dis. , vol. 138, no. 2, pp. 337–340, 1988.
- 4[4] R. Ferri, B. B. Koo, D. L. Picchietti, and S. Fulda, “Periodic leg movements during sleep: phenotype, neurophysiology, and clinical significance,” Sleep Med. , vol. 31, pp. 29–38, 2017.
- 5[5] American Academy of Sleep Medicine, International classification of sleep disorders , 3rd ed. Darien, Il: American Academy of Sleep Medicine, 2014.
- 6[6] R. G. Norman, I. Pal, C. Stewart, J. A. Walsleben, and D. M. Rapoport, “Interobserver agreement among sleep scorers from different centers in a large dataset.” Sleep , vol. 23, no. 7, pp. 901–8, 2000.
- 7[7] R. S. Rosenberg and S. Van Hout, “The American Academy of Sleep Medicine Inter-scorer Reliability Program: Sleep Stage Scoring,” J. Clin. Sleep Med. , vol. 9, no. 1, pp. 81–87, 2013.
- 8[8] M. Younes, J. Raneri, and P. Hanly, “Staging sleep in polysomnograms: Analysis of inter-scorer variability,” J. Clin. Sleep Med. , vol. 12, no. 6, pp. 885–894, 2016.
