Enhancing Adversarial Robustness of Vision-Language Models through Low-Rank Adaptation
Yuheng Ji, Yue Liu, Zhicheng Zhang, Zhao Zhang, Yuting Zhao, Xiaoshuai, Hao, Gang Zhou, Xingwei Zhang, Xiaolong Zheng

TL;DR
This paper introduces AdvLoRA, a low-rank adaptation method that enhances the adversarial robustness of vision-language models while reducing computational costs, addressing security vulnerabilities and resource inefficiencies.
Contribution
We propose a novel, parameter-efficient adversarial adaptation technique called AdvLoRA, leveraging low-rank properties, parameter clustering, and adaptive updates to improve robustness and efficiency of VLMs.
Findings
AdvLoRA significantly improves adversarial robustness of VLMs.
The method reduces computational costs compared to traditional techniques.
Extensive experiments validate the effectiveness of AdvLoRA.
Abstract
Vision-Language Models (VLMs) play a crucial role in the advancement of Artificial General Intelligence (AGI). As AGI rapidly evolves, addressing security concerns has emerged as one of the most significant challenges for VLMs. In this paper, we present extensive experiments that expose the vulnerabilities of conventional adaptation methods for VLMs, highlighting significant security risks. Moreover, as VLMs grow in size, the application of traditional adversarial adaptation techniques incurs substantial computational costs. To address these issues, we propose a parameter-efficient adversarial adaptation method called \textbf{\textit{AdvLoRA}} based on Low-Rank Adaptation. We investigate and reveal the inherent low-rank properties involved in adversarial adaptation for VLMs. Different from LoRA, we enhance the efficiency and robustness of adversarial adaptation by introducing a novel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Topic Modeling · Natural Language Processing Techniques
