# Analyzing the Role of Model Uncertainty for Electronic Health Records

**Authors:** Michael W. Dusenberry, Dustin Tran, Edward Choi, Jonas Kemp, Jeremy, Nixon, Ghassen Jerfel, Katherine Heller, Andrew M. Dai

arXiv: 1906.03842 · 2020-03-27

## TL;DR

This paper investigates the importance of model uncertainty in medical predictions, revealing that population metrics often miss patient-specific uncertainties and proposing Bayesian embeddings as an efficient alternative.

## Contribution

It demonstrates that Bayesian embeddings can effectively capture model uncertainty in medical RNNs, offering a more efficient approach than ensembles.

## Key findings

- Population-level metrics do not reflect model uncertainty.
- Significant variability exists in patient-specific predictions.
- Bayesian embeddings outperform ensembles in capturing uncertainty.

## Abstract

In medicine, both ethical and monetary costs of incorrect predictions can be significant, and the complexity of the problems often necessitates increasingly complex models. Recent work has shown that changing just the random seed is enough for otherwise well-tuned deep neural networks to vary in their individual predicted probabilities. In light of this, we investigate the role of model uncertainty methods in the medical domain. Using RNN ensembles and various Bayesian RNNs, we show that population-level metrics, such as AUC-PR, AUC-ROC, log-likelihood, and calibration error, do not capture model uncertainty. Meanwhile, the presence of significant variability in patient-specific predictions and optimal decisions motivates the need for capturing model uncertainty. Understanding the uncertainty for individual patients is an area with clear clinical impact, such as determining when a model decision is likely to be brittle. We further show that RNNs with only Bayesian embeddings can be a more efficient way to capture model uncertainty compared to ensembles, and we analyze how model uncertainty is impacted across individual input features and patient subgroups.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1906.03842/full.md

## Figures

11 figures with captions in the complete paper: https://tomesphere.com/paper/1906.03842/full.md

## References

36 references — full list in the complete paper: https://tomesphere.com/paper/1906.03842/full.md

---
Source: https://tomesphere.com/paper/1906.03842