TL;DR
MuSc-V2 introduces a zero-shot multimodal framework for industrial anomaly detection that leverages mutual scoring and neighborhood aggregation to improve accuracy without labeled data.
Contribution
The paper presents MuSc-V2, a novel zero-shot anomaly classification and segmentation method that integrates 2D/3D cues and mutual scoring for enhanced performance.
Findings
Achieves +23.7% AP on MVTec 3D-AD dataset.
Surpasses previous zero-shot benchmarks.
Outperforms many few-shot methods.
Abstract
Zero-shot anomaly classification (AC) and segmentation (AS) methods aim to identify and outline defects without using any labeled samples. In this paper, we reveal a key property that is overlooked by existing methods: normal image patches across industrial products typically find many other similar patches, not only in 2D appearance but also in 3D shapes, while anomalies remain diverse and isolated. To explicitly leverage this discriminative property, we propose a Mutual Scoring framework (MuSc-V2) for zero-shot AC/AS, which flexibly supports single 2D/3D or multimodality. Specifically, our method begins by improving 3D representation through Iterative Point Grouping (IPG), which reduces false positives from discontinuous surfaces. Then we use Similarity Neighborhood Aggregation with Multi-Degrees (SNAMD) to fuse 2D/3D neighborhood cues into more discriminative multi-scale patch…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
