Bias in Machine Learning Models Can Be Significantly Mitigated by Careful Training: Evidence from Neuroimaging Studies
Rongguang Wang, Pratik Chaudhari, Christos Davatzikos

TL;DR
This study demonstrates that with careful training, machine learning models in neuroimaging can achieve unbiased, accurate predictions across diverse demographic groups and conditions, challenging assumptions about inherent bias.
Contribution
The paper provides evidence that properly trained models using multi-source data can generalize well and reduce bias in neuroimaging diagnostics for brain diseases.
Findings
Models trained with multi-source data are less biased across demographic groups.
Proper training can lead to high AUC and fairness in diverse subgroups.
Additional demographic and clinical features can improve model performance and fairness.
Abstract
Despite the great promise that machine learning has offered in many fields of medicine, it has also raised concerns about potential biases and poor generalization across genders, age distributions, races and ethnicities, hospitals, and data acquisition equipment and protocols. In the current study, and in the context of three brain diseases, we provide evidence which suggests that when properly trained, machine learning models can generalize well across diverse conditions and do not necessarily suffer from bias. Specifically, by using multi-study magnetic resonance imaging consortia for diagnosing Alzheimer's disease, schizophrenia, and autism spectrum disorder, we find that well-trained models have a high area-under-the-curve (AUC) on subjects across different subgroups pertaining to attributes such as gender, age, racial groups, and different clinical studies and are unbiased under…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
