Normality Prior Guided Multi-Semantic Fusion Network for Unsupervised Image Anomaly Detection
Muhao Xu, Xueying Zhou, Xizhan Gao, Weiye Song, Guang Feng, and Sijie Niu

TL;DR
This paper introduces a normality prior guided multi-semantic fusion network that enhances unsupervised image anomaly detection by incorporating global semantic features of normal samples, achieving state-of-the-art results.
Contribution
It proposes a novel multi-semantic fusion approach guided by normality priors, utilizing a vision-language network and semantic codebooks to improve anomaly detection accuracy.
Findings
Achieves state-of-the-art performance on MVTec LOCO AD dataset
Improves pixel-sPRO by 5.7%
Enhances image-AUROC by 2.6%
Abstract
Recently, detecting logical anomalies is becoming a more challenging task compared to detecting structural ones. Existing encoder decoder based methods typically compress inputs into low-dimensional bottlenecks on the assumption that the compression process can effectively suppress the transmission of logical anomalies to the decoder. However, logical anomalies present a particular difficulty because, while their local features often resemble normal semantics, their global semantics deviate significantly from normal patterns. Thanks to the generalisation capabilities inherent in neural networks, these abnormal semantic features can propagate through low-dimensional bottlenecks. This ultimately allows the decoder to reconstruct anomalous images with misleading fidelity. To tackle the above challenge, we propose a novel normality prior guided multi-semantic fusion network for unsupervised…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
