Anomagic: Crossmodal Prompt-driven Zero-shot Anomaly Generation
Yuxin Jiang, Wei Luo, Hui Zhang, Qiyu Chen, Haiming Yao, Weiming Shen, Yunkang Cao

TL;DR
Anomagic is a zero-shot anomaly generation method that uses crossmodal prompts to create realistic anomalies without exemplars, improving downstream detection accuracy and enabling versatile anomaly synthesis.
Contribution
It introduces a novel crossmodal prompt encoding scheme and a large dataset, AnomVerse, for training a versatile anomaly generation model.
Findings
Outperforms prior methods in realism and variety of generated anomalies
Enhances downstream anomaly detection accuracy
Can generate anomalies for any normal image using user prompts
Abstract
We propose Anomagic, a zero-shot anomaly generation method that produces semantically coherent anomalies without requiring any exemplar anomalies. By unifying both visual and textual cues through a crossmodal prompt encoding scheme, Anomagic leverages rich contextual information to steer an inpainting-based generation pipeline. A subsequent contrastive refinement strategy enforces precise alignment between synthesized anomalies and their masks, thereby bolstering downstream anomaly detection accuracy. To facilitate training, we introduce AnomVerse, a collection of 12,987 anomaly-mask-caption triplets assembled from 13 publicly available datasets, where captions are automatically generated by multimodal large language models using structured visual prompts and template-based textual hints. Extensive experiments demonstrate that Anomagic trained on AnomVerse can synthesize more realistic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning
