Adversarial Prompt Distillation for Vision-Language Models

Lin Luo; Xin Wang; Bojia Zi; Shihao Zhao; Xingjun Ma; Yu-Gang Jiang

arXiv:2411.15244·cs.CV·September 17, 2025

Adversarial Prompt Distillation for Vision-Language Models

Lin Luo, Xin Wang, Bojia Zi, Shihao Zhao, Xingjun Ma, Yu-Gang Jiang

PDF

Open Access

TL;DR

This paper introduces Adversarial Prompt Distillation, a bimodal knowledge transfer framework that improves the robustness and accuracy of vision-language models against adversarial attacks by jointly optimizing prompts for both modalities.

Contribution

It proposes a novel bimodal knowledge distillation approach that enhances adversarial prompt tuning for VLMs, outperforming existing single-modal methods.

Findings

01

APD improves robustness against adversarial attacks.

02

APD achieves higher clean accuracy than prior methods.

03

Using a non-robust teacher can still enhance model robustness.

Abstract

Large pre-trained Vision-Language Models (VLMs) such as Contrastive Language-Image Pre-training (CLIP) have been shown to be susceptible to adversarial attacks, raising concerns about their deployment in safety-critical applications like autonomous driving and medical diagnosis. One promising approach for robustifying pre-trained VLMs is Adversarial Prompt Tuning (APT), which applies adversarial training during the process of prompt tuning. However, existing APT methods are mostly single-modal methods that design prompt(s) for only the visual or textual modality, limiting their effectiveness in either robustness or clean accuracy. In this work, we propose Adversarial Prompt Distillation (APD), a bimodal knowledge distillation framework that enhances APT by integrating it with multi-modal knowledge transfer. APD optimizes prompts for both visual and textual modalities while distilling…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Topic Modeling · Natural Language Processing Techniques

MethodsKnowledge Distillation · Contrastive Language-Image Pre-training