Divide and Conquer: A Hybrid Strategy Defeats Multimodal Large Language Models

Yanxu Mao; Peipei Liu; Tiehan Cui; Zhaoteng Yan; Congying Liu; Datao You

arXiv:2412.16555·cs.CL·May 30, 2025

Divide and Conquer: A Hybrid Strategy Defeats Multimodal Large Language Models

Yanxu Mao, Peipei Liu, Tiehan Cui, Zhaoteng Yan, Congying Liu, Datao You

PDF

Open Access

TL;DR

This paper introduces JMLLM, a multimodal jailbreaking approach that effectively exploits vulnerabilities in large language models across text, visual, and auditory modalities, supported by a new dataset and extensive experiments.

Contribution

It presents a novel multimodal jailbreaking method and a comprehensive dataset, advancing security testing for large language models across multiple modalities.

Findings

01

High attack success rates across 13 LLMs

02

Significant reduction in attack time overhead

03

Enhanced coverage of jailbreak modalities

Abstract

Large language models (LLMs) are widely applied in various fields of society due to their powerful reasoning, understanding, and generation capabilities. However, the security issues associated with these models are becoming increasingly severe. Jailbreaking attacks, as an important method for detecting vulnerabilities in LLMs, have been explored by researchers who attempt to induce these models to generate harmful content through various attack methods. Nevertheless, existing jailbreaking methods face numerous limitations, such as excessive query counts, limited coverage of jailbreak modalities, low attack success rates, and simplistic evaluation methods. To overcome these constraints, this paper proposes a multimodal jailbreaking method: JMLLM. This method integrates multiple strategies to perform comprehensive jailbreak attacks across text, visual, and auditory modalities.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling