Diagnosis Uncertain Models For Medical Risk Prediction
Alexander Peysakhovich, Rich Caruana, Yin Aphinyanaphongs

TL;DR
This paper examines the limitations of all-cause patient risk models that lack diagnostic data, revealing their tendency to underestimate risks for rare diagnoses and proposing a method to incorporate diagnostic uncertainty for improved interpretability.
Contribution
It introduces a novel approach to model diagnostic uncertainty in risk prediction, enhancing interpretability and addressing underestimation issues in rare but risky diagnoses.
Findings
Models generalize well across diagnoses but have predictable failure modes.
Underestimation of risk occurs for rare, high-risk diagnoses due to averaging effects.
Explicit modeling of diagnostic uncertainty improves risk estimation and interpretability.
Abstract
We consider a patient risk models which has access to patient features such as vital signs, lab values, and prior history but does not have access to a patient's diagnosis. For example, this occurs in a model deployed at intake time for triage purposes. We show that such `all-cause' risk models have good generalization across diagnoses but have a predictable failure mode. When the same lab/vital/history profiles can result from diagnoses with different risk profiles (e.g. E.coli vs. MRSA) the risk estimate is a probability weighted average of these two profiles. This leads to an under-estimation of risk for rare but highly risky diagnoses. We propose a fix for this problem by explicitly modeling the uncertainty in risk prediction coming from uncertainty in patient diagnoses. This gives practitioners an interpretable way to understand patient risk beyond a single risk number.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Health Systems, Economic Evaluations, Quality of Life · Bayesian Modeling and Causal Inference
