TL;DR
This paper introduces a structural similarity-based perceptual loss for autoencoders, significantly improving unsupervised defect segmentation accuracy on complex real-world image datasets.
Contribution
It proposes replacing pixel-wise error with a structural similarity measure in autoencoders, enhancing defect detection in challenging scenarios.
Findings
Achieves better defect segmentation performance than traditional pixel-wise error methods.
Effective on real-world datasets of nanofibrous materials and woven fabrics.
Outperforms existing state-of-the-art approaches.
Abstract
Convolutional autoencoders have emerged as popular methods for unsupervised defect segmentation on image data. Most commonly, this task is performed by thresholding a pixel-wise reconstruction error based on an distance. This procedure, however, leads to large residuals whenever the reconstruction encompasses slight localization inaccuracies around edges. It also fails to reveal defective regions that have been visually altered when intensity values stay roughly consistent. We show that these problems prevent these approaches from being applied to complex real-world scenarios and that it cannot be easily avoided by employing more elaborate architectures such as variational or feature matching autoencoders. We propose to use a perceptual loss function based on structural similarity which examines inter-dependencies between local image regions, taking into account luminance,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
