A Hybrid Defense Strategy for Boosting Adversarial Robustness in   Vision-Language Models

Yuhan Liang; Yijun Li; Yumeng Niu; Qianhe Shen; Hangyu Liu

arXiv:2410.14911·cs.CV·October 22, 2024

A Hybrid Defense Strategy for Boosting Adversarial Robustness in Vision-Language Models

Yuhan Liang, Yijun Li, Yumeng Niu, Qianhe Shen, Hangyu Liu

PDF

Open Access

TL;DR

This paper introduces a hybrid adversarial training framework that combines multiple attack strategies and machine learning techniques to improve the robustness of Vision-Language Models like CLIP against diverse adversarial attacks, with significant experimental validation.

Contribution

The paper presents a novel adversarial training approach that integrates various attack methods and machine learning models to enhance VLM robustness beyond existing techniques.

Findings

01

Significantly improved robustness of CLIP against adversarial attacks.

02

Achieved 43.5% accuracy on adversarially perturbed images, outperforming baseline.

03

High classification accuracy of 98% with neural networks and 85.26% success rate with XGBoost.

Abstract

The robustness of Vision-Language Models (VLMs) such as CLIP is critical for their deployment in safety-critical applications like autonomous driving, healthcare diagnostics, and security systems, where accurate interpretation of visual and textual data is essential. However, these models are highly susceptible to adversarial attacks, which can severely compromise their performance and reliability in real-world scenarios. Previous methods have primarily focused on improving robustness through adversarial training and generating adversarial examples using models like FGSM, AutoAttack, and DeepFool. However, these approaches often rely on strong assumptions, such as fixed perturbation norms or predefined attack patterns, and involve high computational complexity, making them challenging to implement in practical settings. In this paper, we propose a novel adversarial training framework…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning

MethodsContrastive Language-Image Pre-training