Enhancing Adversarial Robustness of Vision-Language Models through   Low-Rank Adaptation

Yuheng Ji; Yue Liu; Zhicheng Zhang; Zhao Zhang; Yuting Zhao; Xiaoshuai; Hao; Gang Zhou; Xingwei Zhang; Xiaolong Zheng

arXiv:2404.13425·cs.CV·February 21, 2025·1 cites

Enhancing Adversarial Robustness of Vision-Language Models through Low-Rank Adaptation

Yuheng Ji, Yue Liu, Zhicheng Zhang, Zhao Zhang, Yuting Zhao, Xiaoshuai, Hao, Gang Zhou, Xingwei Zhang, Xiaolong Zheng

PDF

Open Access

TL;DR

This paper introduces AdvLoRA, a low-rank adaptation method that enhances the adversarial robustness of vision-language models while reducing computational costs, addressing security vulnerabilities and resource inefficiencies.

Contribution

We propose a novel, parameter-efficient adversarial adaptation technique called AdvLoRA, leveraging low-rank properties, parameter clustering, and adaptive updates to improve robustness and efficiency of VLMs.

Findings

01

AdvLoRA significantly improves adversarial robustness of VLMs.

02

The method reduces computational costs compared to traditional techniques.

03

Extensive experiments validate the effectiveness of AdvLoRA.

Abstract

Vision-Language Models (VLMs) play a crucial role in the advancement of Artificial General Intelligence (AGI). As AGI rapidly evolves, addressing security concerns has emerged as one of the most significant challenges for VLMs. In this paper, we present extensive experiments that expose the vulnerabilities of conventional adaptation methods for VLMs, highlighting significant security risks. Moreover, as VLMs grow in size, the application of traditional adversarial adaptation techniques incurs substantial computational costs. To address these issues, we propose a parameter-efficient adversarial adaptation method called \textbf{\textit{AdvLoRA}} based on Low-Rank Adaptation. We investigate and reveal the inherent low-rank properties involved in adversarial adaptation for VLMs. Different from LoRA, we enhance the efficiency and robustness of adversarial adaptation by introducing a novel…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Topic Modeling · Natural Language Processing Techniques