Transferable Adversarial Attacks on SAM and Its Downstream Models

Song Xia; Wenhan Yang; Yi Yu; Xun Lin; Henghui Ding and; Ling-Yu Duan; Xudong Jiang

arXiv:2410.20197·cs.LG·April 22, 2025

Transferable Adversarial Attacks on SAM and Its Downstream Models

Song Xia, Wenhan Yang, Yi Yu, Xun Lin, Henghui Ding and, Ling-Yu Duan, Xudong Jiang

PDF

Open Access 1 Repo

TL;DR

This paper investigates the vulnerability of downstream models fine-tuned from the segment anything model (SAM) to transfer-based adversarial attacks, proposing a novel method that enhances attack transferability without needing access to downstream datasets.

Contribution

It introduces a universal meta-initialization algorithm and a gradient robust loss to improve transfer-based adversarial attacks on SAM and its downstream models, even without dataset access.

Findings

01

Effective adversarial attacks on SAM and downstream models demonstrated

02

Proposed methods outperform existing transfer-based attack techniques

03

Enhanced robustness and transferability of adversarial examples achieved

Abstract

The utilization of large foundational models has a dilemma: while fine-tuning downstream tasks from them holds promise for making use of the well-generalized knowledge in practical applications, their open accessibility also poses threats of adverse usage. This paper, for the first time, explores the feasibility of adversarial attacking various downstream models fine-tuned from the segment anything model (SAM), by solely utilizing the information from the open-sourced SAM. In contrast to prevailing transfer-based adversarial attacks, we demonstrate the existence of adversarial dangers even without accessing the downstream task and dataset to train a similar surrogate model. To enhance the effectiveness of the adversarial attack towards models fine-tuned on unknown datasets, we propose a universal meta-initialization (UMI) algorithm to extract the intrinsic vulnerability inherent in the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xiasong0501/grat
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Security and Verification in Computing · Cryptographic Implementations and Security

MethodsSegment Anything Model