Clinical Interpretability of Deep Learning Segmentation Through Shapley-Derived Agreement and Uncertainty Metrics

Tianyi Ren; Daniel Low; Pittra Jaengprajak; Juampablo Heras Rivera; Jacob Ruzevick; Mehmet Kurt

arXiv:2512.07224·eess.IV·December 9, 2025

Clinical Interpretability of Deep Learning Segmentation Through Shapley-Derived Agreement and Uncertainty Metrics

Tianyi Ren, Daniel Low, Pittra Jaengprajak, Juampablo Heras Rivera, Jacob Ruzevick, Mehmet Kurt

PDF

Open Access

TL;DR

This paper introduces Shapley-derived agreement and uncertainty metrics to improve the clinical interpretability of deep learning segmentation models in medical imaging, aiding clinicians in understanding model reliability.

Contribution

It proposes a novel use of Shapley values for assessing feature importance and model agreement in medical image segmentation, with clinically interpretable reliability metrics.

Findings

01

Higher model performance correlates with greater agreement with clinical rankings.

02

Shapley ranking variance is negatively correlated with segmentation accuracy.

03

Metrics effectively reflect model reliability and interpretability.

Abstract

Segmentation is the identification of anatomical regions of interest, such as organs, tissue, and lesions, serving as a fundamental task in computer-aided diagnosis in medical imaging. Although deep learning models have achieved remarkable performance in medical image segmentation, the need for explainability remains critical for ensuring their acceptance and integration in clinical practice, despite the growing research attention in this area. Our approach explored the use of contrast-level Shapley values, a systematic perturbation of model inputs to assess feature importance. While other studies have investigated gradient-based techniques through identifying influential regions in imaging inputs, Shapley values offer a broader, clinically aligned approach, explaining how model performance is fairly attributed to certain imaging contrasts over others. Using the BraTS 2024 dataset, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · AI in cancer detection · Radiomics and Machine Learning in Medical Imaging