A systematic evaluation of uncertainty quantification techniques in deep learning: a case study in photoplethysmography signal analysis
Ciaran Bench, Oskar Pfeffer, Vivek Desai, Mohammad Moulaeifard, Lo\"ic Coquelin, Peter H. Charlton, Nils Strodthoff, Nando Hegemann, Philip J. Aston, Andrew Thompson

TL;DR
This paper systematically compares eight uncertainty quantification techniques in deep learning models trained on photoplethysmography data for clinical prediction tasks, revealing that the optimal method depends on specific evaluation criteria and practical use cases.
Contribution
It introduces a comprehensive evaluation framework for uncertainty quantification methods in medical time-series analysis, highlighting the importance of local calibration and task-specific assessment.
Findings
Uncertainty technique effectiveness varies with evaluation metrics and reliability scales.
Local calibration and adaptivity provide valuable insights into model behavior.
Evaluation criteria should align with practical clinical use cases.
Abstract
In principle, deep learning models trained on medical time-series, including wearable photoplethysmography (PPG) sensor data, can provide a means to continuously monitor physiological parameters outside of clinical settings. However, there is considerable risk of poor performance when deployed in practical measurement scenarios leading to negative patient outcomes. Reliable uncertainties accompanying predictions can provide guidance to clinicians in their interpretation of the trustworthiness of model outputs. It is therefore of interest to compare the effectiveness of different approaches. Here we implement an unprecedented set of eight uncertainty quantification (UQ) techniques to models trained on two clinically relevant prediction tasks: Atrial Fibrillation (AF) detection (classification), and two variants of blood pressure regression. We formulate a comprehensive evaluation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNon-Invasive Vital Sign Monitoring · Healthcare Technology and Patient Monitoring · Heart Rate Variability and Autonomic Control
