RunawayEvil: Jailbreaking the Image-to-Video Generative Models

Songping Wang; Rufan Qian; Yueming Lyu; Qinglong Liu; Linzhuang Zou; Jie Qin; Songhua Liu; Caifeng Shan

arXiv:2512.06674·cs.CV·December 9, 2025

RunawayEvil: Jailbreaking the Image-to-Video Generative Models

Songping Wang, Rufan Qian, Yueming Lyu, Qinglong Liu, Linzhuang Zou, Jie Qin, Songhua Liu, Caifeng Shan

PDF

Open Access

TL;DR

RunawayEvil introduces a novel self-evolving multimodal jailbreak framework that significantly enhances attack success rates on image-to-video models, highlighting critical security vulnerabilities in current systems.

Contribution

This paper presents the first adaptive, reinforcement learning-based multimodal jailbreak framework for I2V models, enabling continuous attack strategy evolution without human input.

Findings

01

Achieves state-of-the-art attack success rates on commercial I2V models.

02

Outperforms existing methods by 58.5 to 79 percent on COCO2017.

03

Demonstrates the effectiveness of self-evolving attack strategies.

Abstract

Image-to-Video (I2V) generation synthesizes dynamic visual content from image and text inputs, providing significant creative control. However, the security of such multimodal systems, particularly their vulnerability to jailbreak attacks, remains critically underexplored. To bridge this gap, we propose RunawayEvil, the first multimodal jailbreak framework for I2V models with dynamic evolutionary capability. Built on a "Strategy-Tactic-Action" paradigm, our framework exhibits self-amplifying attack through three core components: (1) Strategy-Aware Command Unit that enables the attack to self-evolve its strategies through reinforcement learning-driven strategy customization and LLM-based strategy exploration; (2) Multimodal Tactical Planning Unit that generates coordinated text jailbreak instructions and image tampering guidelines based on the selected strategies; (3) Tactical Action…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Security and Verification in Computing