Attention Masks Help Adversarial Attacks to Bypass Safety Detectors

Yunfan Shi

arXiv:2411.04772·cs.CR·November 8, 2024

Attention Masks Help Adversarial Attacks to Bypass Safety Detectors

Yunfan Shi

PDF

Open Access 1 Repo

TL;DR

This paper introduces an adaptive attention mask framework that enhances the stealth and efficiency of PGD adversarial attacks against classifiers protected by explainability-based detectors, outperforming existing methods.

Contribution

It proposes a novel attention mask generation method using mutation XAI mixture and multitask self-supervised X-UNet to improve adversarial attack stealth and explainability.

Findings

01

Outperforms benchmark PGD, Sparsefool, and SINIFGSM in stealth and efficiency

02

Effective in fooling SOTA explainability-based defense classifiers

03

Demonstrates success on MNIST and CIFAR-10 datasets

Abstract

Despite recent research advancements in adversarial attack methods, current approaches against XAI monitors are still discoverable and slower. In this paper, we present an adaptive framework for attention mask generation to enable stealthy, explainable and efficient PGD image classification adversarial attack under XAI monitors. Specifically, we utilize mutation XAI mixture and multitask self-supervised X-UNet for attention mask generation to guide PGD attack. Experiments on MNIST (MLP), CIFAR-10 (AlexNet) have shown that our system can outperform benchmark PGD, Sparsefool and SOTA SINIFGSM in balancing among stealth, efficiency and explainability which is crucial for effectively fooling SOTA defense protected classifiers.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

FrankShi9/Attention-Mask-Attack
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Electrostatic Discharge in Electronics · Integrated Circuits and Semiconductor Failure Analysis

MethodsSoftmax · Attention Is All You Need