AnoStyler: Text-Driven Localized Anomaly Generation via Lightweight Style Transfer
Yulim So, Seokho Kang

TL;DR
AnoStyler introduces a lightweight, text-guided style transfer method for realistic, localized anomaly generation from normal images, improving data diversity and detection performance.
Contribution
It presents a novel zero-shot anomaly generation approach using lightweight models and CLIP-based loss, overcoming realism, data, and model size limitations of prior methods.
Findings
Outperforms existing methods in image quality and diversity
Enhances anomaly detection accuracy with generated data
Operates efficiently with minimal data and model size
Abstract
Anomaly generation has been widely explored to address the scarcity of anomaly images in real-world data. However, existing methods typically suffer from at least one of the following limitations, hindering their practical deployment: (1) lack of visual realism in generated anomalies; (2) dependence on large amounts of real images; and (3) use of memory-intensive, heavyweight model architectures. To overcome these limitations, we propose AnoStyler, a lightweight yet effective method that frames zero-shot anomaly generation as text-guided style transfer. Given a single normal image along with its category label and expected defect type, an anomaly mask indicating the localized anomaly regions and two-class text prompts representing the normal and anomaly states are generated using generalizable category-agnostic procedures. A lightweight U-Net model trained with CLIP-based loss functions…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Generative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning
