Fairness in Cardiac MR Image Analysis: An Investigation of Bias Due to Data Imbalance in Deep Learning Based Segmentation
Esther Puyol-Anton, Bram Ruijsink, Stefan K. Piechnik, Stefan, Neubauer, Steffen E. Petersen, Reza Razavi, and Andrew P. King

TL;DR
This paper investigates racial and gender bias in deep learning-based cardiac MR segmentation caused by data imbalance, and evaluates strategies to improve fairness across demographic groups.
Contribution
It introduces and compares three bias mitigation strategies for fairness in cardiac MR segmentation models trained on imbalanced datasets.
Findings
Significant racial bias observed in model performance.
All proposed strategies improved fairness metrics.
Protected group models yielded the best fairness results.
Abstract
The subject of "fairness" in artificial intelligence (AI) refers to assessing AI algorithms for potential bias based on demographic characteristics such as race and gender, and the development of algorithms to address this bias. Most applications to date have been in computer vision, although some work in healthcare has started to emerge. The use of deep learning (DL) in cardiac MR segmentation has led to impressive results in recent years, and such techniques are starting to be translated into clinical practice. However, no work has yet investigated the fairness of such models. In this work, we perform such an analysis for racial/gender groups, focusing on the problem of training data imbalance, using a nnU-Net model trained and evaluated on cine short axis cardiac MR data from the UK Biobank dataset, consisting of 5,903 subjects from 6 different racial groups. We find statistically…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
