Demographic Bias of Expert-Level Vision-Language Foundation Models in Medical Imaging
Yuzhe Yang, Yujia Liu, Xin Liu, Avanti Gulhane, Domenico Mastrodicasa,, Wei Wu, Edward J Wang, Dushyant W Sahani, Shwetak Patel

TL;DR
This paper investigates demographic biases in state-of-the-art vision-language models for medical imaging, revealing underdiagnosis of marginalized groups and highlighting ethical concerns for equitable healthcare.
Contribution
It provides a comprehensive analysis of demographic biases in foundation models for chest X-ray diagnosis across multiple datasets, emphasizing their potential to worsen healthcare disparities.
Findings
Models underdiagnose marginalized groups compared to radiologists.
Biases are consistent across various pathologies and demographics.
Model embeddings encode significant demographic information.
Abstract
Advances in artificial intelligence (AI) have achieved expert-level performance in medical imaging applications. Notably, self-supervised vision-language foundation models can detect a broad spectrum of pathologies without relying on explicit training annotations. However, it is crucial to ensure that these AI models do not mirror or amplify human biases, thereby disadvantaging historically marginalized groups such as females or Black patients. The manifestation of such biases could systematically delay essential medical care for certain patient subgroups. In this study, we investigate the algorithmic fairness of state-of-the-art vision-language foundation models in chest X-ray diagnosis across five globally-sourced datasets. Our findings reveal that compared to board-certified radiologists, these foundation models consistently underdiagnose marginalized groups, with even higher rates…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education
