Adversarial Attacks to Multi-Modal Models
Zhihao Dou, Xin Hu, Haibo Yang, Zhuqing Liu, and Minghong Fang

TL;DR
This paper introduces CrossFire, a novel attack method that effectively manipulates multi-modal models by transforming targeted inputs and optimizing perturbations, exposing vulnerabilities in current defenses across multiple datasets.
Contribution
We propose CrossFire, a new attack approach that aligns targeted inputs with original modalities and outperforms existing methods in deceiving multi-modal models.
Findings
CrossFire significantly outperforms existing attacks.
Current defenses are largely ineffective against CrossFire.
Experiments conducted on six benchmark datasets.
Abstract
Multi-modal models have gained significant attention due to their powerful capabilities. These models effectively align embeddings across diverse data modalities, showcasing superior performance in downstream tasks compared to their unimodal counterparts. Recent study showed that the attacker can manipulate an image or audio file by altering it in such a way that its embedding matches that of an attacker-chosen targeted input, thereby deceiving downstream models. However, this method often underperforms due to inherent disparities in data from different modalities. In this paper, we introduce CrossFire, an innovative approach to attack multi-modal models. CrossFire begins by transforming the targeted input chosen by the attacker into a format that matches the modality of the original image or audio file. We then formulate our attack as an optimization problem, aiming to minimize the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning
MethodsSoftmax · Attention Is All You Need · ALIGN
