Not All Prompts Are Secure: A Switchable Backdoor Attack Against   Pre-trained Vision Transformers

Sheng Yang; Jiawang Bai; Kuofeng Gao; Yong Yang; Yiming Li; Shu-tao; Xia

arXiv:2405.10612·cs.CV·May 20, 2024

Not All Prompts Are Secure: A Switchable Backdoor Attack Against Pre-trained Vision Transformers

Sheng Yang, Jiawang Bai, Kuofeng Gao, Yong Yang, Yiming Li, Shu-tao, Xia

PDF

Open Access 1 Repo

TL;DR

This paper introduces SWARM, a novel switchable backdoor attack on pre-trained vision transformers that uses a switch token to covertly activate malicious behavior, posing security risks in cloud API applications.

Contribution

The paper proposes SWARM, a new attack method that employs a switch token to enable stealthy, switchable backdoor attacks on vision transformers, which is difficult to detect and remove.

Findings

01

Achieves over 95% attack success rate.

02

Remains stealthy and hard to detect or remove.

03

Effective across diverse visual recognition tasks.

Abstract

Given the power of vision transformers, a new learning paradigm, pre-training and then prompting, makes it more efficient and effective to address downstream visual recognition tasks. In this paper, we identify a novel security threat towards such a paradigm from the perspective of backdoor attacks. Specifically, an extra prompt token, called the switch token in this work, can turn the backdoor mode on, i.e., converting a benign model into a backdoored one. Once under the backdoor mode, a specific trigger can force the model to predict a target class. It poses a severe risk to the users of cloud API, since the malicious behavior can not be activated and detected under the benign mode, thus making the attack very stealthy. To attack a pre-trained model, our proposed attack, named SWARM, learns a trigger and prompt tokens including a switch token. They are optimized with the clean loss…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

20000yshust/swarm
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Memory and Neural Computing · Security and Verification in Computing