Evaluating the Fairness of Deep Learning Uncertainty Estimates in Medical Image Analysis
Raghav Mehta, Changjian Shui, Tal Arbel

TL;DR
This paper investigates how fairness-enhancing methods in deep learning affect uncertainty estimates in medical image analysis tasks, highlighting a tradeoff between fairness and uncertainty accuracy.
Contribution
It is the first to analyze the impact of fairness models on both performance and uncertainty quantification in medical imaging deep learning models.
Findings
Fairness methods improve subgroup performance in some tasks.
Fairness methods can degrade the quality of uncertainty estimates.
Tradeoff exists between fairness and uncertainty accuracy.
Abstract
Although deep learning (DL) models have shown great success in many medical image analysis tasks, deployment of the resulting models into real clinical contexts requires: (1) that they exhibit robustness and fairness across different sub-populations, and (2) that the confidence in DL model predictions be accurately expressed in the form of uncertainties. Unfortunately, recent studies have indeed shown significant biases in DL models across demographic subgroups (e.g., race, sex, age) in the context of medical image analysis, indicating a lack of fairness in the models. Although several methods have been proposed in the ML literature to mitigate a lack of fairness in DL models, they focus entirely on the absolute performance between groups without considering their effect on uncertainty estimation. In this work, we present the first exploration of the effect of popular fairness models on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · AI in cancer detection · Radiomics and Machine Learning in Medical Imaging
