SeaS: Few-shot Industrial Anomaly Image Generation with Separation and Sharing Fine-tuning
Zhewei Dai, Shilei Zeng, Haotian Liu, Xurui Li, Feng Xue, Yu Zhou

TL;DR
SeaS is a unified generative model that creates diverse anomalies, normal products, and precise masks for industrial applications, improving over prior methods by using a novel separation and sharing fine-tuning approach.
Contribution
The paper introduces SeaS, a unified model leveraging U-Net's differentiation ability and novel loss functions to generate diverse anomalies and normal products with high accuracy.
Findings
Sets new benchmarks in anomaly generation quality.
Improves downstream anomaly detection and segmentation performance.
Enables creation of unseen anomalies through attribute recombination.
Abstract
We introduce SeaS, a unified industrial generative model for automatically creating diverse anomalies, authentic normal products, and precise anomaly masks. While extensive research exists, most efforts either focus on specific tasks, i.e., anomalies or normal products only, or require separate models for each anomaly type. Consequently, prior methods either offer limited generative capability or depend on a vast array of anomaly-specific models. We demonstrate that U-Net's differentiated learning ability captures the distinct visual traits of slightly-varied normal products and diverse anomalies, enabling us to construct a unified model for all tasks. Specifically, we first introduce an Unbalanced Abnormal (UA) Text Prompt, comprising one normal token and multiple anomaly tokens. More importantly, our Decoupled Anomaly Alignment (DA) loss decouples anomaly attributes and binds them to…
Peer Reviews
Decision·Submitted to ICLR 2025
1. Leveraging text prompts to guide the model in decoupling the generation of abnormal regions and objects. 2. Using VAE to generate high-resolution annotations is a good direction.
1. The relationship between anomaly tokens and training different types of anomalies is not clear. 2. The paper has not discussed how to control the type of exceptions generated during inference. 3. The paper does not explain why the U-net used to predict noise in the Refined Mask Prediction branch has a highly discriminative feature.
+ The motivation is good. A shared generation model for multiple anomaly types is proposed to solve the problem of insufficient anomaly images. + The generated anomaly images seem more real than other GAN-based methods. + Some ablation studies are provided to facilitate the understanding of how the performance benefits from different components, including the DA loss, the NA loss, and the refined mask prediction branch.
- The experimental results are insufficient. Because the ultimate goal of generating abnormal images is to improve the performance of anomaly detection tasks, some SOTA anomaly detection methods should also be compared on image AUROC, pixel AUROC and PRO besides generative model-based anomaly detection methods, e.g., DiAD [1]. Although RealNet [2] is compared in appendix A.5, the proposed method does not significantly outperform RealNet, particularly AUROC, and RealNet does not use any anomaly s
In general, the proposed method is reasonable and the results are fine. + Both anomaly image generation and anomaly segmentation are considered. + This paper is clearly presented. + The proposed method achieves realistic and diverse generation of abnormal samples.
- The word “Unbalanced” in Unbalanced Abnormal Text Prompt is inappropriate. The author believes fixed generic semantic words may fail to align with a few training images that contain specific defect types. Therefore, the text prompt should be expressed as dynamic or learnable, etc. - The author should add the results of Baseline in Table 4, such as generation of typical text prompt with fixed generic semantic words. Moreover, the result with different layers in UNet in RMP branch should be dis
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Image Processing Techniques and Applications
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Concatenated Skip Connection · Convolution · Max Pooling · U-Net
