Boosting Imperceptibility of Stable Diffusion-based Adversarial Examples   Generation with Momentum

Nashrah Haque; Xiang Li; Zhehui Chen; Yanzhao Wu; Lei Yu; Arun Iyengar; and Wenqi Wei

arXiv:2410.13122·cs.CV·October 18, 2024

Boosting Imperceptibility of Stable Diffusion-based Adversarial Examples Generation with Momentum

Nashrah Haque, Xiang Li, Zhehui Chen, Yanzhao Wu, Lei Yu, Arun Iyengar, and Wenqi Wei

PDF

Open Access 1 Repo

TL;DR

This paper introduces SD-MIAE, a novel framework that uses stable diffusion and momentum optimization to generate highly effective, visually imperceptible adversarial examples that can fool classifiers while maintaining semantic similarity.

Contribution

The paper presents a new method combining stable diffusion and momentum to improve the quality and effectiveness of adversarial examples in neural network testing.

Findings

01

Achieves a 79% misclassification rate, 35% higher than previous methods.

02

Maintains high visual fidelity and semantic similarity in adversarial images.

03

Enhances the stability and robustness of adversarial perturbations.

Abstract

We propose a novel framework, Stable Diffusion-based Momentum Integrated Adversarial Examples (SD-MIAE), for generating adversarial examples that can effectively mislead neural network classifiers while maintaining visual imperceptibility and preserving the semantic similarity to the original class label. Our method leverages the text-to-image generation capabilities of the Stable Diffusion model by manipulating token embeddings corresponding to the specified class in its latent space. These token embeddings guide the generation of adversarial images that maintain high visual fidelity. The SD-MIAE framework consists of two phases: (1) an initial adversarial optimization phase that modifies token embeddings to produce misclassified yet natural-looking images and (2) a momentum-based optimization phase that refines the adversarial perturbations. By introducing momentum, our approach…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nashrahhaque/sd-miae
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Adversarial Robustness in Machine Learning

MethodsDiffusion