Adversarial Prompt Distillation for Vision-Language Models
Lin Luo, Xin Wang, Bojia Zi, Shihao Zhao, Xingjun Ma, Yu-Gang Jiang

TL;DR
This paper introduces Adversarial Prompt Distillation, a bimodal knowledge transfer framework that improves the robustness and accuracy of vision-language models against adversarial attacks by jointly optimizing prompts for both modalities.
Contribution
It proposes a novel bimodal knowledge distillation approach that enhances adversarial prompt tuning for VLMs, outperforming existing single-modal methods.
Findings
APD improves robustness against adversarial attacks.
APD achieves higher clean accuracy than prior methods.
Using a non-robust teacher can still enhance model robustness.
Abstract
Large pre-trained Vision-Language Models (VLMs) such as Contrastive Language-Image Pre-training (CLIP) have been shown to be susceptible to adversarial attacks, raising concerns about their deployment in safety-critical applications like autonomous driving and medical diagnosis. One promising approach for robustifying pre-trained VLMs is Adversarial Prompt Tuning (APT), which applies adversarial training during the process of prompt tuning. However, existing APT methods are mostly single-modal methods that design prompt(s) for only the visual or textual modality, limiting their effectiveness in either robustness or clean accuracy. In this work, we propose Adversarial Prompt Distillation (APD), a bimodal knowledge distillation framework that enhances APT by integrating it with multi-modal knowledge transfer. APD optimizes prompts for both visual and textual modalities while distilling…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Topic Modeling · Natural Language Processing Techniques
MethodsKnowledge Distillation · Contrastive Language-Image Pre-training
