Transferable Adversarial Attacks on SAM and Its Downstream Models
Song Xia, Wenhan Yang, Yi Yu, Xun Lin, Henghui Ding and, Ling-Yu Duan, Xudong Jiang

TL;DR
This paper investigates the vulnerability of downstream models fine-tuned from the segment anything model (SAM) to transfer-based adversarial attacks, proposing a novel method that enhances attack transferability without needing access to downstream datasets.
Contribution
It introduces a universal meta-initialization algorithm and a gradient robust loss to improve transfer-based adversarial attacks on SAM and its downstream models, even without dataset access.
Findings
Effective adversarial attacks on SAM and downstream models demonstrated
Proposed methods outperform existing transfer-based attack techniques
Enhanced robustness and transferability of adversarial examples achieved
Abstract
The utilization of large foundational models has a dilemma: while fine-tuning downstream tasks from them holds promise for making use of the well-generalized knowledge in practical applications, their open accessibility also poses threats of adverse usage. This paper, for the first time, explores the feasibility of adversarial attacking various downstream models fine-tuned from the segment anything model (SAM), by solely utilizing the information from the open-sourced SAM. In contrast to prevailing transfer-based adversarial attacks, we demonstrate the existence of adversarial dangers even without accessing the downstream task and dataset to train a similar surrogate model. To enhance the effectiveness of the adversarial attack towards models fine-tuned on unknown datasets, we propose a universal meta-initialization (UMI) algorithm to extract the intrinsic vulnerability inherent in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Security and Verification in Computing · Cryptographic Implementations and Security
MethodsSegment Anything Model
