Training-Free Anomaly Generation via Dual-Attention Enhancement in Diffusion Model

Zuo Zuo; Jiahao Dong; Yanyun Qu; Zongze Wu

arXiv:2508.11550·cs.CV·August 18, 2025

Training-Free Anomaly Generation via Dual-Attention Enhancement in Diffusion Model

Zuo Zuo, Jiahao Dong, Yanyun Qu, Zongze Wu

PDF

TL;DR

This paper introduces AAG, a training-free framework leveraging Stable Diffusion with cross- and self-attention enhancements to generate realistic anomalies for industrial defect detection, improving data diversity and detection performance.

Contribution

AAG is a novel training-free anomaly generation method that enhances Stable Diffusion with attention mechanisms to produce high-fidelity anomalies without extra training data.

Findings

01

AAG effectively generates realistic anomalies in specific regions.

02

Generated anomalies improve downstream defect detection accuracy.

03

AAG outperforms existing methods in anomaly realism and utility.

Abstract

Industrial anomaly detection (AD) plays a significant role in manufacturing where a long-standing challenge is data scarcity. A growing body of works have emerged to address insufficient anomaly data via anomaly generation. However, these anomaly generation methods suffer from lack of fidelity or need to be trained with extra data. To this end, we propose a training-free anomaly generation framework dubbed AAG, which is based on Stable Diffusion (SD)'s strong generation ability for effective anomaly image generation. Given a normal image, mask and a simple text prompt, AAG can generate realistic and natural anomalies in the specific regions and simultaneously keep contents in other regions unchanged. In particular, we propose Cross-Attention Enhancement (CAE) to re-engineer the cross-attention mechanism within Stable Diffusion based on the given mask. CAE increases the similarity…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.