Black-Box Adversarial Attack on Vision Language Models for Autonomous Driving
Lu Wang, Tianyuan Zhang, Yang Qu, Siyuan Liang, Yuwei Chen, Aishan, Liu, Xianglong Liu, Dacheng Tao

TL;DR
This paper introduces a novel black-box adversarial attack method called Cascading Adversarial Disruption (CAD) targeting vision-language models in autonomous driving, significantly improving attack success and demonstrating real-world effectiveness.
Contribution
We propose CAD, a new black-box attack framework for VLMs in autonomous driving, addressing dynamic scenarios and reasoning chains, and provide a large adversarial dataset for future research.
Findings
CAD achieves +13.43% attack success rate over existing methods.
Real-world attacks reduce route completion by 61.11% and cause vehicle crashes.
The CADA dataset contains 18,808 adversarial visual-question-answer pairs.
Abstract
Vision-language models (VLMs) have significantly advanced autonomous driving (AD) by enhancing reasoning capabilities; however, these models remain highly susceptible to adversarial attacks. While existing research has explored white-box attacks to some extent, the more practical and challenging black-box scenarios remain largely underexplored due to their inherent difficulty. In this paper, we take the first step toward designing black-box adversarial attacks specifically targeting VLMs in AD. We identify two key challenges for achieving effective black-box attacks in this context: the effectiveness across driving reasoning chains in AD systems and the dynamic nature of driving scenarios. To address this, we propose Cascading Adversarial Disruption (CAD). It first introduces Decision Chain Disruption, which targets low-level reasoning breakdown by generating and injecting deceptive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning
