Natural Reflection Backdoor Attack on Vision Language Model for Autonomous Driving
Ming Liu, Siyuan Liang, Koushik Howlader, Liwen Wang, Dacheng Tao, Wensheng Zhang

TL;DR
This paper introduces a natural reflection backdoor attack on vision-language models used in autonomous driving, causing delays in responses when specific visual triggers are present, thus threatening system safety.
Contribution
It presents a novel backdoor attack method using natural reflection patterns and irrelevant textual prefixes to induce latency in VLMs for autonomous driving.
Findings
Models show normal accuracy on clean data
Triggered responses cause significant inference delays
Attack exploits real-time constraints in autonomous systems
Abstract
Vision-Language Models (VLMs) have been integrated into autonomous driving systems to enhance reasoning capabilities through tasks such as Visual Question Answering (VQA). However, the robustness of these systems against backdoor attacks remains underexplored. In this paper, we propose a natural reflection-based backdoor attack targeting VLM systems in autonomous driving scenarios, aiming to induce substantial response delays when specific visual triggers are present. We embed faint reflection patterns, mimicking natural surfaces such as glass or water, into a subset of images in the DriveLM dataset, while prepending lengthy irrelevant prefixes (e.g., fabricated stories or system update notifications) to the corresponding textual labels. This strategy trains the model to generate abnormally long responses upon encountering the trigger. We fine-tune two state-of-the-art VLMs, Qwen2-VL…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Adversarial Robustness in Machine Learning · Advanced Neural Network Applications
