Normality Prior Guided Multi-Semantic Fusion Network for Unsupervised Image Anomaly Detection

Muhao Xu; Xueying Zhou; Xizhan Gao; Weiye Song; Guang Feng; and Sijie Niu

arXiv:2506.18544·cs.CV·June 24, 2025

Normality Prior Guided Multi-Semantic Fusion Network for Unsupervised Image Anomaly Detection

Muhao Xu, Xueying Zhou, Xizhan Gao, Weiye Song, Guang Feng, and Sijie Niu

PDF

TL;DR

This paper introduces a normality prior guided multi-semantic fusion network that enhances unsupervised image anomaly detection by incorporating global semantic features of normal samples, achieving state-of-the-art results.

Contribution

It proposes a novel multi-semantic fusion approach guided by normality priors, utilizing a vision-language network and semantic codebooks to improve anomaly detection accuracy.

Findings

01

Achieves state-of-the-art performance on MVTec LOCO AD dataset

02

Improves pixel-sPRO by 5.7%

03

Enhances image-AUROC by 2.6%

Abstract

Recently, detecting logical anomalies is becoming a more challenging task compared to detecting structural ones. Existing encoder decoder based methods typically compress inputs into low-dimensional bottlenecks on the assumption that the compression process can effectively suppress the transmission of logical anomalies to the decoder. However, logical anomalies present a particular difficulty because, while their local features often resemble normal semantics, their global semantics deviate significantly from normal patterns. Thanks to the generalisation capabilities inherent in neural networks, these abnormal semantic features can propagate through low-dimensional bottlenecks. This ultimately allows the decoder to reconstruct anomalous images with misleading fidelity. To tackle the above challenge, we propose a novel normality prior guided multi-semantic fusion network for unsupervised…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.