Uncertainty Reliability Under Domain Shift: An Investigation for Data-Driven Blood Pressure Estimation in Photoplethysmography
Mohammad Moulaeifard, Ciaran Bench, Philip J. Aston, Nils Strodthoff

TL;DR
This study evaluates the reliability of uncertainty quantification methods for deep learning-based blood pressure estimation from PPG signals under domain shift, emphasizing the importance of calibration and robustness.
Contribution
It compares various uncertainty quantification techniques and calibration methods, identifying the most robust and well-calibrated approaches for out-of-distribution blood pressure prediction.
Findings
Deep ensembles outperform Monte Carlo dropout under domain shift.
Recalibrated Gaussian negative log-likelihood yields the best uncertainty calibration.
Conformal prediction and temperature scaling provide consistent improvements.
Abstract
Uncertainty quantification (UQ) is critical for safety-critical domains like healthcare, yet it is rarely evaluated under realistic out-of-distribution (OOD) conditions. Here, we assessed predictive performance and uncertainty reliability for deep learning-based blood pressure (BP) estimation from photoplethysmography (PPG) signals under both in-distribution (ID) and OOD settings. Using an XResNet1D-50 trained on PulseDB and tested on four external datasets, we compared deep ensembles (DE) and Monte Carlo dropout (MCD) with Gaussian negative log-likelihood (GNLL) and mean squared error (MSE) losses, optionally followed by post-hoc recalibration via conformal prediction (CP), temperature scaling (TS), and isotonic regression (IR). The key findings of our study are as follows: (1) DE provides stronger predictive robustness under domain shift than MCD, an advantage that becomes clear…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
