Seeing is Deceiving: Exploitation of Visual Pathways in Multi-Modal Language Models
Pete Janowczyk, Linda Laurier, Ave Giulietta, Arlo Octavia, Meade, Cleti

TL;DR
This paper reviews how visual inputs in multi-modal language models can be exploited through various attack strategies, highlighting security risks and discussing defense methods to improve robustness.
Contribution
It categorizes attack strategies on visual pathways in MLLMs and evaluates current defenses, proposing directions for enhancing security in multi-modal AI systems.
Findings
Visual inputs can be manipulated with subtle tweaks to deceive models.
Advanced cross-modal attacks like VLATTACK can mislead models while remaining undetectable.
Current defenses have limitations, and new adaptive methods are needed for better security.
Abstract
Multi-Modal Language Models (MLLMs) have transformed artificial intelligence by combining visual and text data, making applications like image captioning, visual question answering, and multi-modal content creation possible. This ability to understand and work with complex information has made MLLMs useful in areas such as healthcare, autonomous systems, and digital content. However, integrating multiple types of data also creates security risks. Attackers can manipulate either the visual or text inputs, or both, to make the model produce unintended or even harmful responses. This paper reviews how visual inputs in MLLMs can be exploited by various attack strategies. We break down these attacks into categories: simple visual tweaks and cross-modal manipulations, as well as advanced strategies like VLATTACK, HADES, and Collaborative Multimodal Adversarial Attack (Co-Attack). These…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications
