Technical Report for ICML 2024 TiFA Workshop MLLM Attack Challenge:   Suffix Injection and Projected Gradient Descent Can Easily Fool An MLLM

Yangyang Guo; Ziwei Xu; Xilie Xu; YongKang Wong; Liqiang; Nie; Mohan Kankanhalli

arXiv:2412.15614·cs.CR·December 23, 2024

Technical Report for ICML 2024 TiFA Workshop MLLM Attack Challenge: Suffix Injection and Projected Gradient Descent Can Easily Fool An MLLM

Yangyang Guo, Ziwei Xu, Xilie Xu, YongKang Wong, Liqiang, Nie, Mohan Kankanhalli

PDF

Open Access

TL;DR

This technical report presents a top-ranked attack method for MLLMs using suffix injection and PGD to fool the LLaVA 1.5 model, demonstrating vulnerabilities in multimodal models.

Contribution

The paper introduces a novel combination of suffix injection and PGD for effective adversarial attacks on MLLMs, advancing attack strategies in multimodal AI.

Findings

01

Suffix injection successfully misleads MLLMs.

02

PGD adds imperceptible perturbations to images.

03

Combined approach achieves high attack success rate.

Abstract

This technical report introduces our top-ranked solution that employs two approaches, \ie suffix injection and projected gradient descent (PGD) , to address the TiFA workshop MLLM attack challenge. Specifically, we first append the text from an incorrectly labeled option (pseudo-labeled) to the original query as a suffix. Using this modified query, our second approach applies the PGD method to add imperceptible perturbations to the image. Combining these two techniques enables successful attacks on the LLaVA 1.5 model.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Cryptographic Implementations and Security