An Empirical Study on the Fairness of Foundation Models for Multi-Organ Image Segmentation
Qin Li, Yizhe Zhang, Yan Li, Jun Lyu, Meng Liu, Longyu Sun, Mengting, Sun, Qirong Li, Wenyue Mao, Xinran Wu, Yajing Zhang, Yinghua Chu, Shuo Wang, and Chengyan Wang

TL;DR
This study evaluates the fairness of large medical image segmentation models across demographic groups, revealing significant biases and disparities in performance and error distribution, highlighting the need for fairness considerations in model development.
Contribution
It introduces a curated benchmark dataset with demographic details and provides the first comprehensive fairness analysis of foundation models in multi-organ medical image segmentation.
Findings
Significant performance disparities across demographic groups.
Variations in spatial segmentation errors among different populations.
Empirical evidence of fairness concerns in foundation models.
Abstract
The segmentation foundation model, e.g., Segment Anything Model (SAM), has attracted increasing interest in the medical image community. Early pioneering studies primarily concentrated on assessing and improving SAM's performance from the perspectives of overall accuracy and efficiency, yet little attention was given to the fairness considerations. This oversight raises questions about the potential for performance biases that could mirror those found in task-specific deep learning models like nnU-Net. In this paper, we explored the fairness dilemma concerning large segmentation foundation models. We prospectively curate a benchmark dataset of 3D MRI and CT scans of the organs including liver, kidney, spleen, lung and aorta from a total of 1056 healthy subjects with expert segmentations. Crucially, we document demographic details such as gender, age, and body mass index (BMI) for each…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Artificial Intelligence in Healthcare and Education · AI in cancer detection
MethodsSoftmax · Attention Is All You Need · Segment Anything Model
