Embedded Deep Learning for Sleep Staging
Engin T\"uretken, J\'er\^ome Van Zaen, Ricard Delgado-Gonzalo

TL;DR
This paper explores the application of embedded deep learning architectures for sleep staging in resource-constrained wearable devices, aiming to improve sleep analysis in consumer healthcare.
Contribution
It introduces two deep learning models optimized for low-power wearables and compares their performance with traditional hand-crafted algorithms.
Findings
Deep learning models can be adapted for resource-limited sleep monitoring devices.
Embedded DL architectures show promising accuracy for sleep staging.
Comparison indicates potential advantages over traditional algorithms.
Abstract
The rapidly-advancing technology of deep learning (DL) into the world of the Internet of Things (IoT) has not fully entered in the fields of m-Health yet. Among the main reasons are the high computational demands of DL algorithms and the inherent resource-limitation of wearable devices. In this paper, we present initial results for two deep learning architectures used to diagnose and analyze sleep patterns, and we compare them with a previously presented hand-crafted algorithm. The algorithms are designed to be reliable for consumer healthcare applications and to be integrated into low-power wearables with limited computational resources.
| Layer type | Stride | Filter shape | Input size |
|---|---|---|---|
| Conv | 1 | 3 3 32 | 2048 3 |
| Max Pool | 1 | 2 1 | 2048 32 |
| Conv | 2 | 3 32 32 | 1024 32 |
| Conv | 2 | 3 32 48 | 512 32 |
| Conv | 2 | 3 48 64 | 256 48 |
| Conv | 2 | 3 64 64 | 128 64 |
| Conv | 2 | 3 64 64 | 64 64 |
| Conv | 2 | 3 64 64 | 32 64 |
| Conv | 2 | 3 64 64 | 16 64 |
| Conv | 2 | 3 64 64 | 8 64 |
| Conv | 1 | 3 64 64 | 4 64 |
| Conv | 1 | 1 64 64 | 4 64 |
| FC | 1 | 256 256 | 1 256 |
| FC | 1 | 256 21 | 1 256 |
| Algorithm | Accuracy | MACs | Parameters |
|---|---|---|---|
| Hand-designed (baseline) | 47% | — | — |
| ML-RNN | 68% | 1.2M | 1.2M |
| CNN | 81% | 6.2M | 166K |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Embedded Deep Learning for Sleep Staging
Engin Türetken
Embedded Vision System Group, *CSEM
*Neuchâtel, Switzerland
Jérôme Van Zaen
Signal Processing Group, *CSEM
*Neuchâtel, Switzerland
Ricard Delgado-Gonzalo
Embedded Software Group, *CSEM
*Neuchâtel, Switzerland
Abstract
The rapidly-advancing technology of deep learning (DL) into the world of the Internet of Things (IoT) has not fully entered in the fields of m-Health yet. Among the main reasons are the high computational demands of DL algorithms and the inherent resource-limitation of wearable devices. In this paper, we present initial results for two deep learning architectures used to diagnose and analyze sleep patterns, and we compare them with a previously presented hand-crafted algorithm. The algorithms are designed to be reliable for consumer healthcare applications and to be integrated into low-power wearables with limited computational resources.
Index Terms:
CNN, RNN, deep learning, embedded, SoC, sleep, polysomnography, e-health, m-health
I Introduction
Deep learning (DL) is a branch of machine learning based on artificial neural networks that hierarchically model high-level abstractions in data [1]. Thanks to the recent availability of large-scale labeled datasets and powerful hardware to process them, the field has been successful in many fields including computer vision [2], natural language processing [3], and speech analysis [4]. The technology is likely to be disruptive in many application areas and expected to render conventional machine learning techniques obsolete. Consequently, substantial research is being performed to adopt it in the rapidly-growing wearables and IoT markets [5, 6]. According to CCS Insight, the wearable market is expected to grow from over 17 billion by 2021.
In this race of building DL-powered wearables, multiple efforts have been made to streamline and simplify current DL frameworks to enable them to be used in the edge. Since wearable devices are at the extreme edge, they need to minimize their computational footprint in order to maximize battery life time for a proper life-style assessment, only the most low-level frameworks, such as CMSIS-NN [7], are simple enough to be considered given the current state of System-On-Chip (SoC) technology.
This paper presents a proof of concept (PoC) using deep learning architectures for sleep staging and evaluates two implementations fully embeddable in low-power SoC. We compare the performance of these architectures with a hand-crafted algorithm designed for low-power variables.
II Data
The data for the PoC was taken from the PhysioNet/Computing in Cardiology Challenge 2018 (CinC18)111https://www.physionet.org/challenge/2018/. The dataset was contributed by the Massachusetts General Hospital’s (MGH) Computational Clinical Neurophysiology Laboratory, and the Clinical Data Animation Laboratory. The whole dataset includes 1985 subjects who were monitored at an MGH sleep laboratory for the diagnosis of sleep disorders. However, only the training data (994 subjects) was used in this research since it was the only part that was open to the public. The sleep stages of the subjects were annotated by the MGH according to the American Academy of Sleep Medicine [8]. Specifically, every 30-second segment was annotated with one of the following labels: wakefulness, rapid eye movement (REM), NREM stage 1, NREM stage 2, and NREM stage 3, where NREM stands for non-REM. For our analysis, we merged NREM stage 1, NREM stage 2, and NREM stage 3. The dataset contained the following physiological signals at a high sampling rate: electroencephalography (EEG), electrooculography (EOG), electromyography (EMG), electrocardiology (ECG), and oxygen saturation (SaO2). We split the dataset into training and test subsets with 90% for training and 10% for testing. It is worth noting that this dataset contains a high number of pathological cases since it is a very specific cohort. Therefore, it is not representative of the general population but constitutes a challenging medical-grade dataset.
III Methods
III-A Hand-designed
In order to establish a baseline, we used our previously-descried algorithm based on physiological cardiorespiratory cues [9] extracted from photoplethysmographic sensors [10]. For a healthy population, this algorithm proved to achieve a sensitivity and specificity for REM of 89.2% and 77.9% respectively; and for NREM 83.4% and 84.9% respectively. The algorithm was exclusively based on the analysis of the heart rate variability (HRV) and movement (Fig. 1).
III-B Deep learning
The input to the algorithms is a short temporal sequence (around 8.5 minutes) of HRV values at 4Hz and a binary value that denotes whether the subject moved. We developed several deep learning architectures for sleep staging and evaluated two embedded implementations:
- •
Multi-Layer Recurrent Neural Networks (ML-RNNs): RNNs are powerful architectures designed to model long-term temporal relations in the data. They have been shown to work very well in natural language processing.
- •
Convolutional Neural Networks (CNNs): CNNs are neural networks which can capture local patterns with increasing semantic complexity in spatiotemporal data.
In our implementation, the CNN and the ML-RNN architectures employ fully connected and normalization layers, and the latter is based on an efficient version of LSTM with forget gates. More specifically, the ML-RNN architecture is comprised of two LSTM layers respectively with 128 and 32 hidden units, which are followed by two fully connected layers with bias. The CNN architecture contains eleven convolutional layers with a temporal size of 3, a single max-pooling layer, and two fully connected layers (see Table I). The convolutional layers are followed by batch normalization and ReLU units.
IV Results
The RNN-based network brings approximately 20% improvement in mean accuracy over the baseline method and the CNN approximately 35% (see Table II). Higher accuracy obtained by the CNN network suggests that high frequency patterns in the data, which RNNs are ineffective at modeling, is informative for the sleep staging task. As shown in Table II, both architectures are small in size and require limited processing resources to run. Their particular execution time will depend on the SoC that is selected.
V Conclusion
We presented results for two prominent DL architectures used to diagnose and analyze sleep patterns on the CinC2018 dataset, and compared them to a hand-crafted algorithm. The DL architectures outperform by more than 20% the hard-designed baseline which was developed on a database on healthy subjects. Inter-observer agreement of expert sleep scorers on more reliable and multi-modal sensory data such as EEG and EOG is less than 85% [11, 12]. Considering that we used only HRV as input and the pathological nature of the dataset used to train the efficient CNN model, an average accuracy of 81% on 3-class sleep staging is very promising. Future work includes embedding the networks on a resource-limited processor from the ARM Cortex-M family or a low-power neural network hardware accelerator, such as the GAP8.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Y. Le Cun et al. , “Deep learning,” Nature , vol. 521, no. 7553, pp. 436–444, 2015.
- 2[2] K. He et al. , “Deep residual learning for image recognition,” in CVPR’16 , 2016, pp. 770–778.
- 3[3] R. Socher et al. , “Deep learning for NLP (without magic),” in Tutorial Abstracts of ACL 2012 , 2012, p. 5.
- 4[4] L. Deng et al. , “New types of deep neural network learning for speech recognition and related applications: An overview,” in ICASSP’13 , 2013, pp. 8599–8603.
- 5[5] R. Miotto et al. , “Deep learning for healthcare: Review, opportunities and challenges,” Briefings in bioinformatics , vol. 19, no. 6, pp. 1236–1246, 2017.
- 6[6] J. Van Zaen et al. , “Classification of cardiac arrhythmias from single lead ECG with a convolutional recurrent neural network,” in BIOSIGNALS’19 , 2019.
- 7[7] L. Lai et al. , “CMSIS-NN: Efficient neural network kernels for Arm Cortex-M CP Us,” ar Xiv preprint ar Xiv:1801.06601 , 2018.
- 8[8] R. B. Berry et al. , “The AASM manual for the scoring of sleep and associated events,” Rules, Terminology and Technical Specifications, Darien, Illinois, American Academy of Sleep Medicine , 2012.
