Inadequacy of common stochastic neural networks for reliable clinical decision support
Adrian Lindenmeyer, Malte Blattmann, Stefan Franke, Thomas Neumuth,, Daniel Schneider

TL;DR
This study evaluates the reliability of common stochastic neural networks in clinical decision support, revealing their tendency to underestimate uncertainty and overconfidently extrapolate beyond evidence, which limits their safety and trustworthiness.
Contribution
The paper critically assesses stochastic neural networks in healthcare, demonstrating their inadequacy in recognizing out-of-distribution samples and highlighting the need for more reliable uncertainty estimation methods.
Findings
State-of-the-art performance in mortality prediction with EHR data.
Critical underestimation of epistemic uncertainty by current stochastic methods.
Inadequacy of common stochastic approaches for reliable clinical decision support.
Abstract
Widespread adoption of AI for medical decision making is still hindered due to ethical and safety-related concerns. For AI-based decision support systems in healthcare settings it is paramount to be reliable and trustworthy. Common deep learning approaches, however, have the tendency towards overconfidence under data shift. Such inappropriate extrapolation beyond evidence-based scenarios may have dire consequences. This highlights the importance of reliable estimation of local uncertainty and its communication to the end user. While stochastic neural networks have been heralded as a potential solution to these issues, this study investigates their actual reliability in clinical applications. We centered our analysis on the exemplary use case of mortality prediction for ICU hospitalizations using EHR from MIMIC3 study. For predictions on the EHR time series, Encoder-Only Transformer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Insurance, Mortality, Demography, Risk Management · Forecasting Techniques and Applications
MethodsAttention Is All You Need · Residual Connection · Dropout · Byte Pair Encoding · Adam · Label Smoothing · Linear Layer · Multi-Head Attention · Softmax · Dense Connections
