Megatron: Evasive Clean-Label Backdoor Attacks against Vision   Transformer

Xueluan Gong; Bowei Tian; Meng Xue; Shuike Li; Yanjiao Chen; Qian Wang

arXiv:2412.04776·cs.CV·December 9, 2024

Megatron: Evasive Clean-Label Backdoor Attacks against Vision Transformer

Xueluan Gong, Bowei Tian, Meng Xue, Shuike Li, Yanjiao Chen, Qian Wang

PDF

Open Access

TL;DR

This paper introduces Megatron, a novel clean-label backdoor attack on vision transformers that uses attention-based triggers, achieving high success rates and evading existing defenses without label manipulation.

Contribution

The paper proposes a new attention-guided backdoor attack method for vision transformers that does not require label manipulation, enhancing attack success and evasiveness.

Findings

01

Achieves over 90% attack success rate on multiple datasets.

02

Outperforms baseline methods in evading human inspection and defenses.

03

Effective even with trigger position shifts during testing.

Abstract

Vision transformers have achieved impressive performance in various vision-related tasks, but their vulnerability to backdoor attacks is under-explored. A handful of existing works focus on dirty-label attacks with wrongly-labeled poisoned training samples, which may fail if a benign model trainer corrects the labels. In this paper, we propose Megatron, an evasive clean-label backdoor attack against vision transformers, where the attacker injects the backdoor without manipulating the data-labeling process. To generate an effective trigger, we customize two loss terms based on the attention mechanism used in transformer networks, i.e., latent loss and attention diffusion loss. The latent loss aligns the last attention layer between triggered samples and clean samples of the target label. The attention diffusion loss emphasizes the attention diffusion area that encompasses the trigger. A…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Physical Unclonable Functions (PUFs) and Hardware Security · Advanced Malware Detection Techniques

MethodsSoftmax · Attention Is All You Need · Diffusion · Focus