Prompt-Driven Contrastive Learning for Transferable Adversarial Attacks

Hunmin Yang; Jongoh Jeong; Kuk-Jin Yoon

arXiv:2407.20657·cs.CV·March 14, 2025

Prompt-Driven Contrastive Learning for Transferable Adversarial Attacks

Hunmin Yang, Jongoh Jeong, Kuk-Jin Yoon

PDF

Open Access

TL;DR

This paper introduces PDCL-Attack, a novel prompt-driven generative adversarial attack leveraging CLIP's semantic understanding to improve transferability across diverse vision models and domains.

Contribution

It is the first to incorporate prompt learning into generative adversarial attacks for enhanced transferability across models and domains.

Findings

01

Outperforms state-of-the-art transfer attack methods.

02

Effective across various cross-domain and cross-model scenarios.

03

Leverages semantic prompts from class labels for attack guidance.

Abstract

Recent vision-language foundation models, such as CLIP, have demonstrated superior capabilities in learning representations that can be transferable across diverse range of downstream tasks and domains. With the emergence of such powerful models, it has become crucial to effectively leverage their capabilities in tackling challenging vision tasks. On the other hand, only a few works have focused on devising adversarial examples that transfer well to both unknown domains and model architectures. In this paper, we propose a novel transfer attack method called PDCL-Attack, which leverages the CLIP model to enhance the transferability of adversarial perturbations generated by a generative model-based attack framework. Specifically, we formulate an effective prompt-driven feature guidance by harnessing the semantic representation power of text, particularly from the ground-truth class labels…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications

MethodsContrastive Language-Image Pre-training