TAIGen: Training-Free Adversarial Image Generation via Diffusion Models
Susim Roy, Anubhooti Jain, Mayank Vatsa, Richa Singh

TL;DR
TAIGen is a training-free, efficient black-box adversarial image generation method using diffusion models, achieving high success rates with minimal sampling steps and preserving image quality.
Contribution
Introduces TAIGen, a novel approach that generates adversarial images with only 3-20 diffusion steps without training, improving efficiency and attack success.
Findings
Achieves over 70% attack success on ImageNet models.
Maintains PSNR above 30 dB across datasets.
Generates adversarial examples 10x faster than existing methods.
Abstract
Adversarial attacks from generative models often produce low-quality images and require substantial computational resources. Diffusion models, though capable of high-quality generation, typically need hundreds of sampling steps for adversarial generation. This paper introduces TAIGen, a training-free black-box method for efficient adversarial image generation. TAIGen produces adversarial examples using only 3-20 sampling steps from unconditional diffusion models. Our key finding is that perturbations injected during the mixing step interval achieve comparable attack effectiveness without processing all timesteps. We develop a selective RGB channel strategy that applies attention maps to the red channel while using GradCAM-guided perturbations on green and blue channels. This design preserves image structure while maximizing misclassification in target models. TAIGen maintains visual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection
