Diagnostic Uncertainty Calibration: Towards Reliable Machine Predictions   in Medical Domain

Takahiro Mimori; Keiko Sasada; Hirotaka Matsui; Issei Sato

arXiv:2007.01659·stat.ML·March 23, 2021

Diagnostic Uncertainty Calibration: Towards Reliable Machine Predictions in Medical Domain

Takahiro Mimori, Keiko Sasada, Hirotaka Matsui, Issei Sato

PDF

Open Access

TL;DR

This paper introduces a new evaluation framework and a post-hoc calibration method to improve the reliability of probability estimates in medical diagnosis models, especially under label uncertainty.

Contribution

It formalizes metrics for label uncertainty evaluation and proposes alpha-calibration, a novel method for better uncertainty calibration in neural networks.

Findings

01

Alpha-calibration improves the reliability of uncertainty estimates.

02

The framework effectively assesses predictions with label disagreement.

03

Results show enhanced calibration in medical imaging applications.

Abstract

We propose an evaluation framework for class probability estimates (CPEs) in the presence of label uncertainty, which is commonly observed as diagnosis disagreement between experts in the medical domain. We also formalize evaluation metrics for higher-order statistics, including inter-rater disagreement, to assess predictions on label uncertainty. Moreover, we propose a novel post-hoc method called $a l p ha$ -calibration, that equips neural network classifiers with calibrated distributions over CPEs. Using synthetic experiments and a large-scale medical imaging application, we show that our approach significantly enhances the reliability of uncertainty estimates: disagreement probabilities and posterior CPEs.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Anomaly Detection Techniques and Applications · Explainable Artificial Intelligence (XAI)