Mosaic: Multimodal Jailbreak against Closed-Source VLMs via Multi-View Ensemble Optimization

Yuqin Lan; Gen Li; Yuanze Hu; Weihao Shen; Zhaoxin Fan; Faguo Wu; Xiao Zhang; Laurence T. Yang; Zhiming Zheng

arXiv:2604.09253·cs.CV·April 13, 2026

Mosaic: Multimodal Jailbreak against Closed-Source VLMs via Multi-View Ensemble Optimization

Yuqin Lan, Gen Li, Yuanze Hu, Weihao Shen, Zhaoxin Fan, Faguo Wu, Xiao Zhang, Laurence T. Yang, Zhiming Zheng

PDF

TL;DR

Mosaic is a novel ensemble optimization framework that enhances multimodal jailbreak attacks on closed-source VLMs by reducing surrogate dependency through multi-view and multi-model guidance.

Contribution

It introduces Mosaic, a multi-view ensemble approach that improves attack success on closed-source VLMs by mitigating surrogate dependency and overfitting.

Findings

01

Achieves state-of-the-art attack success rate.

02

Demonstrates robustness across heterogeneous surrogate-target settings.

03

Outperforms existing attack methods in toxicity and success metrics.

Abstract

Vision-Language Models (VLMs) are powerful but remain vulnerable to multimodal jailbreak attacks. Existing attacks mainly rely on either explicit visual prompt attacks or gradient-based adversarial optimization. While the former is easier to detect, the latter produces subtle perturbations that are less perceptible, but is usually optimized and evaluated under homogeneous open-source surrogate-target settings, leaving its effectiveness on commercial closed-source VLMs under heterogeneous settings unclear. To examine this issue, we study different surrogate-target settings and observe a consistent gap between homogeneous and heterogeneous settings, a phenomenon we term surrogate dependency. Motivated by this finding, we propose Mosaic, a Multi-view ensemble optimization framework for multimodal jailbreak against closed-source VLMs, which alleviates surrogate dependency under…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.