Black-Box Adversarial Attack on Vision Language Models for Autonomous   Driving

Lu Wang; Tianyuan Zhang; Yang Qu; Siyuan Liang; Yuwei Chen; Aishan; Liu; Xianglong Liu; Dacheng Tao

arXiv:2501.13563·cs.CV·January 24, 2025

Black-Box Adversarial Attack on Vision Language Models for Autonomous Driving

Lu Wang, Tianyuan Zhang, Yang Qu, Siyuan Liang, Yuwei Chen, Aishan, Liu, Xianglong Liu, Dacheng Tao

PDF

Open Access

TL;DR

This paper introduces a novel black-box adversarial attack method called Cascading Adversarial Disruption (CAD) targeting vision-language models in autonomous driving, significantly improving attack success and demonstrating real-world effectiveness.

Contribution

We propose CAD, a new black-box attack framework for VLMs in autonomous driving, addressing dynamic scenarios and reasoning chains, and provide a large adversarial dataset for future research.

Findings

01

CAD achieves +13.43% attack success rate over existing methods.

02

Real-world attacks reduce route completion by 61.11% and cause vehicle crashes.

03

The CADA dataset contains 18,808 adversarial visual-question-answer pairs.

Abstract

Vision-language models (VLMs) have significantly advanced autonomous driving (AD) by enhancing reasoning capabilities; however, these models remain highly susceptible to adversarial attacks. While existing research has explored white-box attacks to some extent, the more practical and challenging black-box scenarios remain largely underexplored due to their inherent difficulty. In this paper, we take the first step toward designing black-box adversarial attacks specifically targeting VLMs in AD. We identify two key challenges for achieving effective black-box attacks in this context: the effectiveness across driving reasoning chains in AD systems and the dynamic nature of driving scenarios. To address this, we propose Cascading Adversarial Disruption (CAD). It first introduces Decision Chain Disruption, which targets low-level reasoning breakdown by generating and injecting deceptive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning