Do LLMs Understand Visual Anomalies? Uncovering LLM's Capabilities in Zero-shot Anomaly Detection
Jiaqi Zhu, Shaofeng Cai, Fang Deng, Beng Chin Ooi, Junran Wu

TL;DR
This paper introduces ALFA, a training-free method that enhances zero-shot visual anomaly detection using large vision-language models by adaptively generating prompts and aligning local image semantics for improved localization accuracy.
Contribution
ALFA employs a run-time prompt adaptation and a novel local semantic aligner to improve zero-shot anomaly detection and localization in vision-language models.
Findings
Achieves 12.1% PRO improvement on MVTec dataset.
Achieves 8.9% PRO improvement on VisA dataset.
Effective in leveraging language models for precise anomaly localization.
Abstract
Large vision-language models (LVLMs) are markedly proficient in deriving visual representations guided by natural language. Recent explorations have utilized LVLMs to tackle zero-shot visual anomaly detection (VAD) challenges by pairing images with textual descriptions indicative of normal and abnormal conditions, referred to as anomaly prompts. However, existing approaches depend on static anomaly prompts that are prone to cross-semantic ambiguity, and prioritize global image-level representations over crucial local pixel-level image-to-text alignment that is necessary for accurate anomaly localization. In this paper, we present ALFA, a training-free approach designed to address these challenges via a unified model. We propose a run-time prompt adaptation strategy, which first generates informative anomaly prompts to leverage the capabilities of a large language model (LLM). This…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsViral Infections and Outbreaks Research · Anomaly Detection Techniques and Applications · COVID-19 diagnosis using AI
