Investigating the Semantic Robustness of CLIP-based Zero-Shot Anomaly   Segmentation

Kevin Stangl; Marius Arvinte; Weilin Xu; Cory Cornelius

arXiv:2405.07969·cs.CV·May 14, 2024·1 cites

Investigating the Semantic Robustness of CLIP-based Zero-Shot Anomaly Segmentation

Kevin Stangl, Marius Arvinte, Weilin Xu, Cory Cornelius

PDF

Open Access

TL;DR

This paper evaluates the robustness of CLIP-based zero-shot anomaly segmentation under semantic perturbations, revealing significant performance drops across different models and highlighting the need for careful robustness assessment.

Contribution

It systematically investigates the semantic robustness of WinCLIP zero-shot anomaly segmentation under various perturbations, providing empirical performance bounds and insights.

Findings

01

Performance drops up to 20% in ROC AUC

02

Performance drops up to 40% in per-region overlap

03

Robustness issues are consistent across different CLIP backbones

Abstract

Zero-shot anomaly segmentation using pre-trained foundation models is a promising approach that enables effective algorithms without expensive, domain-specific training or fine-tuning. Ensuring that these methods work across various environmental conditions and are robust to distribution shifts is an open problem. We investigate the performance of WinCLIP [14] zero-shot anomaly segmentation algorithm by perturbing test data using three semantic transformations: bounded angular rotations, bounded saturation shifts, and hue shifts. We empirically measure a lower performance bound by aggregating across per-sample worst-case perturbations and find that average performance drops by up to 20% in area under the ROC curve and 40% in area under the per-region overlap curve. We find that performance is consistently lowered on three CLIP backbones, regardless of model architecture or learning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Data-Driven Disease Surveillance · COVID-19 diagnosis using AI

MethodsContrastive Language-Image Pre-training