Jailbreaking Large Vision Language Models in Intelligent Transportation Systems

Badhan Chandra Das; Md Tasnim Jawad; Md Jueal Mia; M. Hadi Amini; Yanzhao Wu

arXiv:2511.13892·cs.AI·November 19, 2025

Jailbreaking Large Vision Language Models in Intelligent Transportation Systems

Badhan Chandra Das, Md Tasnim Jawad, Md Jueal Mia, M. Hadi Amini, Yanzhao Wu

PDF

Open Access

TL;DR

This paper reveals vulnerabilities of large vision language models in transportation systems by developing new jailbreaking attacks and proposing a defense, highlighting significant security risks in multimodal AI applications.

Contribution

It introduces a novel jailbreaking attack exploiting image typography and multi-turn prompts, along with a multi-layered filtering defense for LVLMs in ITS.

Findings

01

Jailbreaking attacks can bypass existing defenses in LVLMs.

02

Proposed attack outperforms existing methods in fooling models.

03

Defense reduces inappropriate responses significantly.

Abstract

Large Vision Language Models (LVLMs) demonstrate strong capabilities in multimodal reasoning and many real-world applications, such as visual question answering. However, LVLMs are highly vulnerable to jailbreaking attacks. This paper systematically analyzes the vulnerabilities of LVLMs integrated in Intelligent Transportation Systems (ITS) under carefully crafted jailbreaking attacks. First, we carefully construct a dataset with harmful queries relevant to transportation, following OpenAI's prohibited categories to which the LVLMs should not respond. Second, we introduce a novel jailbreaking attack that exploits the vulnerabilities of LVLMs through image typography manipulation and multi-turn prompting. Third, we propose a multi-layered response filtering defense technique to prevent the model from generating inappropriate responses. We perform extensive experiments with the proposed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Multimodal Machine Learning Applications · Advanced Neural Network Applications