AnomalyVFM -- Transforming Vision Foundation Models into Zero-Shot Anomaly Detectors

Matic Fu\v{c}ka; Vitjan Zavrtanik; Danijel Sko\v{c}aj

arXiv:2601.20524·cs.CV·April 10, 2026

AnomalyVFM -- Transforming Vision Foundation Models into Zero-Shot Anomaly Detectors

Matic Fu\v{c}ka, Vitjan Zavrtanik, Danijel Sko\v{c}aj

PDF

2 Repos 4 Models 1 Datasets

TL;DR

AnomalyVFM introduces a framework that transforms pretrained vision foundation models into effective zero-shot anomaly detectors by combining synthetic data generation and efficient adaptation techniques.

Contribution

It presents a novel approach that significantly improves zero-shot anomaly detection performance using vision foundation models, addressing dataset diversity and adaptation limitations.

Findings

01

Achieves 94.1% AUROC on average across 9 datasets

02

Outperforms previous methods by 3.3 percentage points

03

Utilizes a three-stage synthetic dataset and low-rank feature adapters

Abstract

Zero-shot anomaly detection aims to detect and localise abnormal regions in the image without access to any in-domain training images. While recent approaches leverage vision-language models (VLMs), such as CLIP, to transfer high-level concept knowledge, methods based on purely vision foundation models (VFMs), like DINOv2, have lagged behind in performance. We argue that this gap stems from two practical issues: (i) limited diversity in existing auxiliary anomaly detection datasets and (ii) overly shallow VFM adaptation strategies. To address both challenges, we propose AnomalyVFM, a general and effective framework that turns any pretrained VFM into a strong zero-shot anomaly detector. Our approach combines a robust three-stage synthetic dataset generation scheme with a parameter-efficient adaptation mechanism, utilising low-rank feature adapters and a confidence-weighted pixel loss.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Datasets

MaticFuc/AnomalyVFM_Synthetic_Dataset
dataset· 69 dl
69 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.