Unsupervised Multi-View Visual Anomaly Detection via Progressive Homography-Guided Alignment

Xintao Chen; Xiaohao Xu; Bozhong Zheng; Yun Liu; Yingna Wu

arXiv:2511.18766·cs.CV·November 25, 2025

Unsupervised Multi-View Visual Anomaly Detection via Progressive Homography-Guided Alignment

Xintao Chen, Xiaohao Xu, Bozhong Zheng, Yun Liu, Yingna Wu

PDF

Open Access 1 Video

TL;DR

This paper introduces ViewSense-AD, a framework for unsupervised multi-view visual anomaly detection that aligns features across views using homography, improving robustness to viewpoint changes and achieving state-of-the-art results.

Contribution

It proposes a novel multi-view alignment module with homography, integrated into a diffusion model, to learn viewpoint-invariant representations for anomaly detection.

Findings

01

Outperforms existing methods on RealIAD and MANTA datasets

02

Achieves state-of-the-art accuracy in pixel, view, and sample-level detection

03

Demonstrates robustness to large viewpoint shifts and complex textures

Abstract

Unsupervised visual anomaly detection from multi-view images presents a significant challenge: distinguishing genuine defects from benign appearance variations caused by viewpoint changes. Existing methods, often designed for single-view inputs, treat multiple views as a disconnected set of images, leading to inconsistent feature representations and a high false-positive rate. To address this, we introduce ViewSense-AD (VSAD), a novel framework that learns viewpoint-invariant representations by explicitly modeling geometric consistency across views. At its core is our Multi-View Alignment Module (MVAM), which leverages homography to project and align corresponding feature regions between neighboring views. We integrate MVAM into a View-Align Latent Diffusion Model (VALDM), enabling progressive and multi-stage alignment during the denoising process. This allows the model to build a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Unsupervised Multi-View Visual Anomaly Detection via Progressive Homography-Guided Alignment· underline

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Domain Adaptation and Few-Shot Learning · Digital Media Forensic Detection