Towards objective and systematic evaluation of bias in artificial intelligence for medical imaging
Emma A.M. Stanley, Raissa Souza, Anthony Winder, Vedant Gulve,, Kimberly Amador, Matthias Wilms, Nils D. Forkert

TL;DR
This paper introduces a systematic framework using synthetic medical images to evaluate bias in AI models for medical imaging, enabling controlled analysis of bias effects and mitigation strategies.
Contribution
It presents a novel methodology for objectively assessing bias in medical imaging AI using synthetic data and counterfactual scenarios, facilitating bias mitigation research.
Findings
Revealed biases cause subgroup performance disparities in CNN classifiers.
Reweighing was the most effective bias mitigation strategy tested.
Explainable AI methods help investigate bias manifestation.
Abstract
Artificial intelligence (AI) models trained using medical images for clinical tasks often exhibit bias in the form of disparities in performance between subgroups. Since not all sources of biases in real-world medical imaging data are easily identifiable, it is challenging to comprehensively assess how those biases are encoded in models, and how capable bias mitigation methods are at ameliorating performance disparities. In this article, we introduce a novel analysis framework for systematically and objectively investigating the impact of biases in medical images on AI models. We developed and tested this framework for conducting controlled in silico trials to assess bias in medical imaging AI using a tool for generating synthetic magnetic resonance images with known disease effects and sources of bias. The feasibility is showcased by using three counterfactual bias scenarios to measure…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiomics and Machine Learning in Medical Imaging · Artificial Intelligence in Healthcare and Education · Machine Learning in Healthcare
