ALPEC: A Comprehensive Evaluation Framework and Dataset for Machine Learning-Based Arousal Detection in Clinical Practice
Stefan Kraft, Andreas Theissler, Vera Wienhausen-Wilke, Philipp, Walter, Gjergji Kasneci

TL;DR
This paper introduces ALPEC, a new evaluation framework and dataset for sleep arousal detection using machine learning, emphasizing clinical relevance by focusing on arousal onsets and addressing annotation mismatches.
Contribution
It presents a novel post-processing and evaluation framework tailored for clinical needs and releases a comprehensive multimodal dataset for arousal onset detection.
Findings
Focusing on arousal onsets improves clinical applicability.
The new dataset reflects clinical annotation constraints and includes diverse modalities.
The framework enhances the integration of ML-based arousal detection in clinical practice.
Abstract
Detecting arousals in sleep is essential for diagnosing sleep disorders. However, using Machine Learning (ML) in clinical practice is impeded by fundamental issues, primarily due to mismatches between clinical protocols and ML methods. Clinicians typically annotate only the onset of arousals, while ML methods rely on annotations for both the beginning and end. Additionally, there is no standardized evaluation methodology tailored to clinical needs for arousal detection models. This work addresses these issues by introducing a novel post-processing and evaluation framework emphasizing approximate localization and precise event count (ALPEC) of arousals. We recommend that ML practitioners focus on detecting arousal onsets, aligning with clinical practice. We examine the impact of this shift on current training and evaluation schemes, addressing simplifications and challenges. We utilize a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition
