AnomalyControl: Learning Cross-modal Semantic Features for Controllable Anomaly Synthesis
Shidan He, Lei Liu, Xiujun Shu, Bo Wang, Yuanhao Feng, Shen Zhao

TL;DR
AnomalyControl introduces a cross-modal semantic feature learning framework to generate more realistic and controllable anomaly images by leveraging text-image reference prompts and specialized attention mechanisms.
Contribution
It proposes a novel cross-modal semantic modeling approach with a semantic-guided adapter for improved anomaly synthesis, surpassing existing methods in realism and control.
Findings
Achieves state-of-the-art anomaly synthesis results.
Enhances realism and diversity of generated anomalies.
Improves downstream anomaly inspection tasks.
Abstract
Anomaly synthesis is a crucial approach to augment abnormal data for advancing anomaly inspection. Based on the knowledge from the large-scale pre-training, existing text-to-image anomaly synthesis methods predominantly focus on textual information or coarse-aligned visual features to guide the entire generation process. However, these methods often lack sufficient descriptors to capture the complicated characteristics of realistic anomalies (e.g., the fine-grained visual pattern of anomalies), limiting the realism and generalization of the generation process. To this end, we propose a novel anomaly synthesis framework called AnomalyControl to learn cross-modal semantic features as guidance signals, which could encode the generalized anomaly cues from text-image reference prompts and improve the realism of synthesized abnormal samples. Specifically, AnomalyControl adopts a flexible and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Software Engineering Research · Data Visualization and Analytics
MethodsSoftmax · Attention Is All You Need · Focus · Adapter
