StackCLIP: Clustering-Driven Stacked Prompt in Zero-Shot Industrial Anomaly Detection

Yanning Hou; Yanran Ruan; Junfa Li; Shanshan Wang; Jianfeng Qiu; Ke Xu

arXiv:2506.23577·cs.CV·July 8, 2025

StackCLIP: Clustering-Driven Stacked Prompt in Zero-Shot Industrial Anomaly Detection

Yanning Hou, Yanran Ruan, Junfa Li, Shanshan Wang, Jianfeng Qiu, Ke Xu

PDF

Open Access

TL;DR

StackCLIP introduces a novel clustering-driven stacked prompt approach that enhances zero-shot industrial anomaly detection by improving text-image alignment, generalization, and anomaly segmentation performance.

Contribution

The paper proposes a new stacked prompt framework with clustering and ensemble modules, significantly improving zero-shot anomaly detection and classification in industrial settings.

Findings

01

Achieves state-of-the-art results on seven datasets.

02

Improves training speed, stability, and convergence.

03

Enhances generalization across classification tasks.

Abstract

Enhancing the alignment between text and image features in the CLIP model is a critical challenge in zero-shot industrial anomaly detection tasks. Recent studies predominantly utilize specific category prompts during pretraining, which can cause overfitting to the training categories and limit model generalization. To address this, we propose a method that transforms category names through multicategory name stacking to create stacked prompts, forming the basis of our StackCLIP model. Our approach introduces two key components. The Clustering-Driven Stacked Prompts (CSP) module constructs generic prompts by stacking semantically analogous categories, while utilizing multi-object textual feature fusion to amplify discriminative anomalies among similar objects. The Ensemble Feature Alignment (EFA) module trains knowledge-specific linear layers tailored for each stack cluster and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Fault Detection and Control Systems · Network Security and Intrusion Detection