CLIP-FSAC++: Few-Shot Anomaly Classification with Anomaly Descriptor Based on CLIP
Zuo Zuo, Jiahao Dong, Yao Wu, Yanyun Qu, Zongze Wu

TL;DR
CLIP-FSAC++ introduces a novel few-shot anomaly classification framework that enhances CLIP's representations through a cross-modality interaction module, improving anomaly detection accuracy in industrial scenarios with limited data.
Contribution
The paper proposes CLIP-FSAC++, a one-stage training framework with an anomaly descriptor that improves CLIP's adaptation for few-shot industrial anomaly classification.
Findings
Effective in 1, 2, 4, 8-shot settings on VisA and MVTEC-AD datasets.
Enhanced correlation between visual and text embeddings improves classification.
Outperforms existing few-shot anomaly detection methods.
Abstract
Industrial anomaly classification (AC) is an indispensable task in industrial manufacturing, which guarantees quality and safety of various product. To address the scarcity of data in industrial scenarios, lots of few-shot anomaly detection methods emerge recently. In this paper, we propose an effective few-shot anomaly classification (FSAC) framework with one-stage training, dubbed CLIP-FSAC++. Specifically, we introduce a cross-modality interaction module named Anomaly Descriptor following image and text encoders, which enhances the correlation of visual and text embeddings and adapts the representations of CLIP from pre-trained data to target data. In anomaly descriptor, image-to-text cross-attention module is used to obtain image-specific text embeddings and text-to-image cross-attention module is used to obtain text-specific visual embeddings. Then these modality-specific…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Network Security and Intrusion Detection · Computational Physics and Python Applications
MethodsSoftmax · Concatenated Skip Connection · Contrastive Language-Image Pre-training
