Patch-aware Vector Quantized Codebook Learning for Unsupervised Visual   Defect Detection

Qisen Cheng; Shuhui Qu; Janghwan Lee

arXiv:2501.09187·cs.CV·January 17, 2025

Patch-aware Vector Quantized Codebook Learning for Unsupervised Visual Defect Detection

Qisen Cheng, Shuhui Qu, Janghwan Lee

PDF

Open Access

TL;DR

This paper introduces a patch-aware vector quantized codebook learning method within an enhanced VQ-VAE framework, significantly improving unsupervised visual defect detection accuracy by optimizing spatial representations and code assignment.

Contribution

The paper presents a novel patch-aware dynamic code assignment scheme for VQ-VAE, enhancing defect detection by better capturing local context and spatial features.

Findings

01

Achieves state-of-the-art results on MVTecAD, BTAD, and MTSD datasets.

02

Improves defect detection accuracy by optimizing spatial code allocation.

03

Enhances normal-defect distinction through context-sensitive code assignment.

Abstract

Unsupervised visual defect detection is critical in industrial applications, requiring a representation space that captures normal data features while detecting deviations. Achieving a balance between expressiveness and compactness is challenging; an overly expressive space risks inefficiency and mode collapse, impairing detection accuracy. We propose a novel approach using an enhanced VQ-VAE framework optimized for unsupervised defect detection. Our model introduces a patch-aware dynamic code assignment scheme, enabling context-sensitive code allocation to optimize spatial representation. This strategy enhances normal-defect distinction and improves detection accuracy during inference. Experiments on MVTecAD, BTAD, and MTSD datasets show our method achieves state-of-the-art performance.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIndustrial Vision Systems and Defect Detection · Image Processing Techniques and Applications · Retinal Imaging and Analysis

MethodsVQ-VAE