An Investigation Into Race Bias in Random Forest Models Based on Breast DCE-MRI Derived Radiomics Features
Mohamed Huti, Tiarna Lee, Elinor Sawyer, Andrew P. King

TL;DR
This study reveals that radiomics features from breast DCE-MRI can encode race information and that random forest models trained on such data exhibit race bias, performing better on the race they are trained on.
Contribution
It demonstrates that classical radiomics-based AI models can encode race information and exhibit bias, highlighting the need for bias mitigation in traditional AI methods.
Findings
Radiomics features contain race-identifiable information.
RF models can predict race with 60-70% accuracy.
Race-imbalanced training leads to biased model performance.
Abstract
Recent research has shown that artificial intelligence (AI) models can exhibit bias in performance when trained using data that are imbalanced by protected attribute(s). Most work to date has focused on deep learning models, but classical AI techniques that make use of hand-crafted features may also be susceptible to such bias. In this paper we investigate the potential for race bias in random forest (RF) models trained using radiomics features. Our application is prediction of tumour molecular subtype from dynamic contrast enhanced magnetic resonance imaging (DCE-MRI) of breast cancer patients. Our results show that radiomics features derived from DCE-MRI data do contain race-identifiable information, and that RF models can be trained to predict White and Black race from these data with 60-70% accuracy, depending on the subset of features used. Furthermore, RF models trained to predict…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiomics and Machine Learning in Medical Imaging · MRI in cancer diagnosis · AI in cancer detection
