Zero-Shot Image Anomaly Detection Using Generative Foundation Models

Lemar Abdi; Amaan Valiuddin; Francisco Caetano; Christiaan Viviers; Fons van der Sommen

arXiv:2507.22692·cs.CV·July 31, 2025

Zero-Shot Image Anomaly Detection Using Generative Foundation Models

Lemar Abdi, Amaan Valiuddin, Francisco Caetano, Christiaan Viviers, Fons van der Sommen

PDF

TL;DR

This paper introduces a novel zero-shot anomaly detection method using diffusion models as perceptual templates, achieving state-of-the-art results without dataset-specific re-training.

Contribution

It presents a new approach leveraging denoising diffusion models and Stein score errors for zero-shot anomaly detection, outperforming existing methods.

Findings

01

Near-perfect performance on some benchmarks

02

Effective use of CelebA as a base distribution

03

Outperforms models trained on ImageNet in certain settings

Abstract

Detecting out-of-distribution (OOD) inputs is pivotal for deploying safe vision systems in open-world environments. We revisit diffusion models, not as generators, but as universal perceptual templates for OOD detection. This research explores the use of score-based generative models as foundational tools for semantic anomaly detection across unseen datasets. Specifically, we leverage the denoising trajectories of Denoising Diffusion Models (DDMs) as a rich source of texture and semantic information. By analyzing Stein score errors, amplified through the Structural Similarity Index Metric (SSIM), we introduce a novel method for identifying anomalous samples without requiring re-training on each target dataset. Our approach improves over state-of-the-art and relies on training a single model on one dataset -- CelebA -- which we find to be an effective base distribution, even…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.