Targeted Attack on Deep RL-based Autonomous Driving with Learned Visual Patterns
Prasanth Buddareddygari, Travis Zhang, Yezhou Yang, Yi Ren

TL;DR
This paper explores a novel targeted attack method on deep reinforcement learning-based autonomous driving systems by using learned visual patterns on physical objects, demonstrating its feasibility and effectiveness in hijacking control policies.
Contribution
It introduces a new threat model combining physical object manipulation with learned visual patterns to attack autonomous driving policies, showing practical feasibility.
Findings
Pre-trained policies can be hijacked within a time window.
The attack remains effective across different driving scenarios.
There is a tradeoff between attack strength and success rate.
Abstract
Recent studies demonstrated the vulnerability of control policies learned through deep reinforcement learning against adversarial attacks, raising concerns about the application of such models to risk-sensitive tasks such as autonomous driving. Threat models for these demonstrations are limited to (1) targeted attacks through real-time manipulation of the agent's observation, and (2) untargeted attacks through manipulation of the physical environment. The former assumes full access to the agent's states/observations at all times, while the latter has no control over attack outcomes. This paper investigates the feasibility of targeted attacks through visually learned patterns placed on physical objects in the environment, a threat model that combines the practicality and effectiveness of the existing ones. Through analysis, we demonstrate that a pre-trained policy can be hijacked within…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications
