Uncovering Modality Discrepancy and Generalization Illusion for General-Purpose 3D Medical Segmentation
Yichi Zhang, Feiyang Xiao, Le Xue, Wenbo Zhang, Gang Feng, Chenguang Zheng, Yuan Qi, Yuan Cheng, Zixin Hu

TL;DR
This paper introduces a new dataset and comprehensive evaluation of 3D medical segmentation models, revealing significant modality discrepancies and highlighting the need for multi-modal training to achieve truly general-purpose medical AI tools.
Contribution
The paper provides the UMD dataset and a rigorous evaluation framework to assess model robustness across different imaging modalities, exposing limitations of current models in real-world scenarios.
Findings
Significant performance gap between benchmark results and real-world modality transitions.
Current models struggle with functional imaging modalities like PET/MRI.
Highlighting the necessity for multi-modal training to improve generalization.
Abstract
While emerging 3D medical foundation models are envisioned as versatile tools with offer general-purpose capabilities, their validation remains largely confined to regional and structural imaging, leaving a significant modality discrepancy unexplored. To provide a rigorous and objective assessment, we curate the UMD dataset comprising 490 whole-body PET/CT and 464 whole-body PET/MRI scans (675k 2D images, 12k 3D organ annotations) and conduct a thorough and comprehensive evaluation of representative 3D segmentation foundation models. Through intra-subject controlled comparisons of paired scans, we isolate imaging modality as the primary independent variable to evaluate model robustness in real-world applications. Our evaluation reveals a stark discrepancy between literature-reported benchmarks and real-world efficacy, particularly when transitioning from structural to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Medical Image Segmentation Techniques · Medical Imaging and Analysis
