FiLo: Zero-Shot Anomaly Detection by Fine-Grained Description and High-Quality Localization
Zhaopeng Gu, Bingke Zhu, Guibo Zhu, Yingying Chen, Hao Li, Ming Tang, Jinqiao Wang

TL;DR
FiLo introduces a zero-shot anomaly detection method that leverages fine-grained descriptions and high-quality localization techniques, significantly improving detection accuracy and localization precision across diverse object categories.
Contribution
The paper proposes FiLo, a novel ZSAD approach combining fine-grained anomaly descriptions with position-enhanced localization, addressing limitations of generic descriptions and single-patch similarity methods.
Findings
Achieves state-of-the-art AUC of 83.9% on VisA dataset.
Demonstrates improved localization accuracy with a pixel-level AUC of 95.9%.
Outperforms existing zero-shot anomaly detection methods.
Abstract
Zero-shot anomaly detection (ZSAD) methods entail detecting anomalies directly without access to any known normal or abnormal samples within the target item categories. Existing approaches typically rely on the robust generalization capabilities of multimodal pretrained models, computing similarities between manually crafted textual features representing "normal" or "abnormal" semantics and image features to detect anomalies and localize anomalous patches. However, the generic descriptions of "abnormal" often fail to precisely match diverse types of anomalies across different object categories. Additionally, computing feature similarities for single patches struggles to pinpoint specific locations of anomalies with various sizes and scales. To address these issues, we propose a novel ZSAD method called FiLo, comprising two components: adaptively learned Fine-Grained Description (FG-Des)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiation Detection and Scintillator Technologies · Medical Imaging Techniques and Applications · Nuclear Physics and Applications
MethodsAttention Is All You Need · Linear Layer · Layer Normalization · Multi-Head Attention · Dense Connections · Residual Connection · Softmax · Vision Transformer · self-DIstillation with NO labels
