Disentangling Prompt Dependence to Evaluate Segmentation Reliability in Gynecological MRI

Elodie Germani (UR; LTSI); Krystel Nyangoh-Timoh; Pierre Jannin (LTSI); John S H Baxter

arXiv:2603.13369·cs.CV·March 17, 2026

Disentangling Prompt Dependence to Evaluate Segmentation Reliability in Gynecological MRI

Elodie Germani (UR, LTSI), Krystel Nyangoh-Timoh, Pierre Jannin (LTSI), John S H Baxter

PDF

Open Access

TL;DR

This paper introduces a framework to evaluate the robustness of promptable segmentation models in gynecological MRI by disentangling prompt ambiguity from local sensitivity, revealing their impact on segmentation reliability.

Contribution

It presents the first explicit formulation of prompt dependence that separates prompt ambiguity from local sensitivity, enabling interpretable assessment of segmentation robustness.

Findings

01

Strong negative correlation between metrics and segmentation performance

02

Low mutual correlation between the two proposed metrics

03

Metrics effectively identify prompt-related failure modes

Abstract

Promptable segmentation models (e.g., the Segment Anything Models) enable generalizable, zero-shot segmentation across diverse domains. Although predictions are deterministic for a fixed image-prompt pair, the robustness of these models to variations in user prompts, referred to as prompt dependence, remains underexplored. In safety-critical workflows with substantial inter-user variability, interpretable and informative frameworks are needed to evaluate prompt dependence. In this work, we assess the reliability of promptable segmentation by analyzing and measuring its sensitivity to prompt variability. We introduce the first formulation of prompt dependence that explicitly disentangles prompt ambiguity (inter-user variability) from local sensitivity (interaction imprecision), offering an interpretable view of segmentation robustness. Experiments on two female pelvic MRI datasets for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFetal and Pediatric Neurological Disorders · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning