Detecting Performance Degradation under Data Shift in Pathology Vision-Language Model
Hao Guan, Li Zhou

TL;DR
This paper investigates methods to detect performance degradation in pathology vision-language models caused by data shift, proposing a combined approach of input data shift detection and output confidence monitoring for improved reliability.
Contribution
The study introduces DomainSAT, a toolbox for analyzing data shift, and demonstrates that combining input shift detection with output confidence indicators enhances performance degradation detection in pathology VLMs.
Findings
Input data shift detection identifies distributional changes effectively.
Output confidence indicators closely relate to actual performance degradation.
Combining both methods improves reliability in monitoring model performance.
Abstract
Vision-Language Models have demonstrated strong potential in medical image analysis and disease diagnosis. However, after deployment, their performance may deteriorate when the input data distribution shifts from that observed during development. Detecting such performance degradation is essential for clinical reliability, yet remains challenging for large pre-trained VLMs operating without labeled data. In this study, we investigate performance degradation detection under data shift in a state-of-the-art pathology VLM. We examine both input-level data shift and output-level prediction behavior to understand their respective roles in monitoring model reliability. To facilitate systematic analysis of input data shift, we develop DomainSAT, a lightweight toolbox with a graphical interface that integrates representative shift detection algorithms and enables intuitive exploration of data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · AI in cancer detection
