Adversarial Flow Matching for Imperceptible Attacks on End-to-End Autonomous Driving
Xinyu Zeng, Xiangkun He, Lei Tao, Chen Lv, Hong Cheng

TL;DR
This paper introduces Adversarial Flow Matching, a gray-box attack method exploiting Transformer vulnerabilities in end-to-end autonomous driving models to generate imperceptible adversarial examples efficiently.
Contribution
The paper presents AFM, a novel one-step adversarial attack framework that effectively targets Transformer-based autonomous driving models with high transferability and imperceptibility.
Findings
AFM significantly degrades AD model performance across scenarios.
The generated adversarial examples are visually imperceptible.
AFM demonstrates strong transferability to different models.
Abstract
Autonomous driving (AD) is evolving towards end-to-end (E2E) frameworks through two primary paradigms: monolithic models exemplified by Vision-Language-Action (VLA), and specialized modular architectures. Despite their divergent designs, both paradigms increasingly rely on Transformer backbones for complex reasoning, potentially causing a shared vulnerability: visually imperceptible perturbations can manipulate E2E AD models into hazardous maneuvers by targeting the Transformer module. Most existing adversarial attack approaches against AD systems operate under white-box or black-box settings; yet, they typically necessitate full model transparency, or suffer from either prohibitive query latency or limited attack transferability. In this paper, we propose Adversarial Flow Matching (AFM), a novel gray-box attack framework that exploits Transformer structural vulnerabilities in E2E AD…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
