MoAPT: Mixture of Adversarial Prompt Tuning for Vision-Language Models

Shiji Zhao; Qihui Zhu; Shukun Xiong; Shouwei Ruan; Maoxun Yuan; Jialing Tao; Jiexi Liu; Ranjie Duan; Jie Zhang; Jie Zhang; Xingxing Wei

arXiv:2505.17509·cs.CV·December 19, 2025

MoAPT: Mixture of Adversarial Prompt Tuning for Vision-Language Models

Shiji Zhao, Qihui Zhu, Shukun Xiong, Shouwei Ruan, Maoxun Yuan, Jialing Tao, Jiexi Liu, Ranjie Duan, Jie Zhang, Jie Zhang, Xingxing Wei

PDF

Open Access

TL;DR

This paper introduces MoAPT, a novel adversarial prompt tuning method for vision-language models that improves robustness against various adversarial attacks by learning mixture prompts and using a conditional weight router.

Contribution

Proposes MoAPT, a mixture of adversarial prompt tuning with a conditional weight router, enhancing VLM robustness against diverse adversarial attacks.

Findings

01

MoAPT outperforms state-of-the-art methods in robustness across 11 datasets.

02

Increasing the number of learned prompts improves adversarial robustness.

03

Sample-specific prompt weighting enhances alignment with adversarial images.

Abstract

Large pre-trained Vision Language Models (VLMs) demonstrate excellent generalization capabilities but remain highly susceptible to adversarial examples, posing potential security risks. To improve the robustness of VLMs against adversarial examples, adversarial prompt tuning methods are proposed to align the text feature with the adversarial image feature without changing model parameters. However, when facing various adversarial attacks, a single learnable text prompt has insufficient generalization to align well with all adversarial image features, which ultimately results in overfitting. To address the above challenge, in this paper, we empirically find that increasing the number of learned prompts yields greater robustness improvements than simply extending the length of a single prompt. Building on this observation, we propose an adversarial tuning method named \textbf{Mixture of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Topic Modeling · Multimodal Machine Learning Applications

MethodsALIGN