Semantic Adversarial Attacks via Diffusion Models

Chenan Wang; Jinhao Duan; Chaowei Xiao; Edward Kim; Matthew Stamm,; Kaidi Xu

arXiv:2309.07398·cs.CV·September 15, 2023·1 cites

Semantic Adversarial Attacks via Diffusion Models

Chenan Wang, Jinhao Duan, Chaowei Xiao, Edward Kim, Matthew Stamm,, Kaidi Xu

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel framework for semantic adversarial attacks using diffusion models, enabling high success rates and realistic perturbations by manipulating semantic features in the latent space.

Contribution

The paper proposes two diffusion-based methods for semantic adversarial attacks, leveraging latent space manipulation for high success rates and broad applicability in white-box and black-box settings.

Findings

01

Achieves nearly 100% attack success rate across multiple scenarios.

02

Demonstrates high fidelity and transferability of generated adversarial examples.

03

Outperforms baseline methods in attack success and image quality metrics.

Abstract

Traditional adversarial attacks concentrate on manipulating clean examples in the pixel space by adding adversarial perturbations. By contrast, semantic adversarial attacks focus on changing semantic attributes of clean examples, such as color, context, and features, which are more feasible in the real world. In this paper, we propose a framework to quickly generate a semantic adversarial attack by leveraging recent diffusion models since semantic information is included in the latent space of well-trained diffusion models. Then there are two variants of this framework: 1) the Semantic Transformation (ST) approach fine-tunes the latent space of the generated image and/or the diffusion model itself; 2) the Latent Masking (LM) approach masks the latent space with another target image and local backpropagation-based interpretation methods. Additionally, the ST approach can be applied in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

steven202/semantic_adv_via_dm
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · COVID-19 diagnosis using AI · Anomaly Detection Techniques and Applications

MethodsFocus · Diffusion