Spatial Autoregressive Modeling of DINOv3 Embeddings for Unsupervised Anomaly Detection
Ertunc Erdil, Nico Schulthess, Guney Tombak, Ender Konukoglu

TL;DR
This paper introduces a spatial autoregressive model for DINOv3 embeddings that explicitly captures spatial dependencies, improving unsupervised anomaly detection efficiency and performance without large memory overhead.
Contribution
It proposes a novel 2D autoregressive CNN framework that models spatial relationships in patch embeddings, reducing memory use and inference time compared to existing prototype-based methods.
Findings
Achieves competitive anomaly detection accuracy on BMAD datasets.
Significantly reduces inference time and memory requirements.
Demonstrates effectiveness of spatial dependency modeling in UAD.
Abstract
DINO models provide rich patch-level representations that have recently enabled strong performance in unsupervised anomaly detection (UAD). Most existing methods extract patch embeddings from ``normal'' images and model them independently, ignoring spatial and neighborhood relationships between patches. This implicitly assumes that self-attention and positional encodings sufficiently encode contextual information within each patch embedding. In addition, the normative distribution is often modeled as memory banks or prototype-based representations, which require storing large numbers of features and performing costly comparisons at inference time, leading to substantial memory and computational overhead. In this work, we address both limitations by proposing a simple and efficient framework that explicitly models spatial and contextual dependencies between patch embeddings using a 2D…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning
