Adversarial Attacks to Multi-Modal Models

Zhihao Dou; Xin Hu; Haibo Yang; Zhuqing Liu; and Minghong Fang

arXiv:2409.06793·cs.CR·September 25, 2024

Adversarial Attacks to Multi-Modal Models

Zhihao Dou, Xin Hu, Haibo Yang, Zhuqing Liu, and Minghong Fang

PDF

Open Access

TL;DR

This paper introduces CrossFire, a novel attack method that effectively manipulates multi-modal models by transforming targeted inputs and optimizing perturbations, exposing vulnerabilities in current defenses across multiple datasets.

Contribution

We propose CrossFire, a new attack approach that aligns targeted inputs with original modalities and outperforms existing methods in deceiving multi-modal models.

Findings

01

CrossFire significantly outperforms existing attacks.

02

Current defenses are largely ineffective against CrossFire.

03

Experiments conducted on six benchmark datasets.

Abstract

Multi-modal models have gained significant attention due to their powerful capabilities. These models effectively align embeddings across diverse data modalities, showcasing superior performance in downstream tasks compared to their unimodal counterparts. Recent study showed that the attacker can manipulate an image or audio file by altering it in such a way that its embedding matches that of an attacker-chosen targeted input, thereby deceiving downstream models. However, this method often underperforms due to inherent disparities in data from different modalities. In this paper, we introduce CrossFire, an innovative approach to attack multi-modal models. CrossFire begins by transforming the targeted input chosen by the attacker into a format that matches the modality of the original image or audio file. We then formulate our attack as an optimization problem, aiming to minimize the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning

MethodsSoftmax · Attention Is All You Need · ALIGN