LADMIM: Logical Anomaly Detection with Masked Image Modeling in Discrete Latent Space
Shunsuke Sakai, Tatushito Hasegawa, Makoto Koshino

TL;DR
LADMIM introduces a novel unsupervised logical anomaly detection framework that leverages masked image modeling and discrete latent representations to effectively identify complex logical anomalies in images.
Contribution
The paper presents a new approach combining masked image modeling with discrete latent space learning for logical anomaly detection, improving accuracy without pre-trained segmentation models.
Findings
Achieves competitive performance on five benchmarks.
Focuses on logical dependencies rather than low-level pixel variance.
Does not require pre-trained segmentation models.
Abstract
Detecting anomalies such as an incorrect combination of objects or deviations in their positions is a challenging problem in unsupervised anomaly detection (AD). Since conventional AD methods mainly focus on local patterns of normal images, they struggle with detecting logical anomalies that appear in the global patterns. To effectively detect these challenging logical anomalies, we introduce Logical Anomaly Detection with Masked Image Modeling (LADMIM), a novel unsupervised AD framework that harnesses the power of masked image modeling and discrete representation learning. Our core insight is that predicting the missing region forces the model to learn the long-range dependencies between patches. Specifically, we formulate AD as a mask completion task, which predicts the distribution of discrete latents in the masked region. As a distribution of discrete latents is invariant to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Network Security and Intrusion Detection · Advanced Malware Detection Techniques
MethodsMutual Information Machine/Mask Image Modeling · Focus
