Semantic Adversarial Attacks via Diffusion Models
Chenan Wang, Jinhao Duan, Chaowei Xiao, Edward Kim, Matthew Stamm,, Kaidi Xu

TL;DR
This paper introduces a novel framework for semantic adversarial attacks using diffusion models, enabling high success rates and realistic perturbations by manipulating semantic features in the latent space.
Contribution
The paper proposes two diffusion-based methods for semantic adversarial attacks, leveraging latent space manipulation for high success rates and broad applicability in white-box and black-box settings.
Findings
Achieves nearly 100% attack success rate across multiple scenarios.
Demonstrates high fidelity and transferability of generated adversarial examples.
Outperforms baseline methods in attack success and image quality metrics.
Abstract
Traditional adversarial attacks concentrate on manipulating clean examples in the pixel space by adding adversarial perturbations. By contrast, semantic adversarial attacks focus on changing semantic attributes of clean examples, such as color, context, and features, which are more feasible in the real world. In this paper, we propose a framework to quickly generate a semantic adversarial attack by leveraging recent diffusion models since semantic information is included in the latent space of well-trained diffusion models. Then there are two variants of this framework: 1) the Semantic Transformation (ST) approach fine-tunes the latent space of the generated image and/or the diffusion model itself; 2) the Latent Masking (LM) approach masks the latent space with another target image and local backpropagation-based interpretation methods. Additionally, the ST approach can be applied in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · COVID-19 diagnosis using AI · Anomaly Detection Techniques and Applications
MethodsFocus · Diffusion
