Anomagic: Crossmodal Prompt-driven Zero-shot Anomaly Generation

Yuxin Jiang; Wei Luo; Hui Zhang; Qiyu Chen; Haiming Yao; Weiming Shen; Yunkang Cao

arXiv:2511.10020·cs.CV·November 14, 2025

Anomagic: Crossmodal Prompt-driven Zero-shot Anomaly Generation

Yuxin Jiang, Wei Luo, Hui Zhang, Qiyu Chen, Haiming Yao, Weiming Shen, Yunkang Cao

PDF

Open Access

TL;DR

Anomagic is a zero-shot anomaly generation method that uses crossmodal prompts to create realistic anomalies without exemplars, improving downstream detection accuracy and enabling versatile anomaly synthesis.

Contribution

It introduces a novel crossmodal prompt encoding scheme and a large dataset, AnomVerse, for training a versatile anomaly generation model.

Findings

01

Outperforms prior methods in realism and variety of generated anomalies

02

Enhances downstream anomaly detection accuracy

03

Can generate anomalies for any normal image using user prompts

Abstract

We propose Anomagic, a zero-shot anomaly generation method that produces semantically coherent anomalies without requiring any exemplar anomalies. By unifying both visual and textual cues through a crossmodal prompt encoding scheme, Anomagic leverages rich contextual information to steer an inpainting-based generation pipeline. A subsequent contrastive refinement strategy enforces precise alignment between synthesized anomalies and their masks, thereby bolstering downstream anomaly detection accuracy. To facilitate training, we introduce AnomVerse, a collection of 12,987 anomaly-mask-caption triplets assembled from 13 publicly available datasets, where captions are automatically generated by multimodal large language models using structured visual prompts and template-based textual hints. Extensive experiments demonstrate that Anomagic trained on AnomVerse can synthesize more realistic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning