Stackelberg Driver Model for Continual Policy Improvement in Scenario-Based Closed-Loop Autonomous Driving
Haoyi Niu, Qimao Chen, Yingyue Li, Yi Zhang, Jianming Hu

TL;DR
This paper introduces the Stackelberg Driver Model (SDM), a hierarchical game-based approach that enables continual policy improvement in autonomous driving by iteratively challenging AVs with adversarial background vehicles.
Contribution
The paper presents a novel leader-follower game framework that models vehicle interactions to facilitate ongoing autonomous vehicle policy refinement using adversarial scenario generation.
Findings
SDM outperforms baseline methods in complex scenarios
The approach enables continuous AV policy enhancement
Generated scenarios become increasingly challenging over iterations
Abstract
The deployment of autonomous vehicles (AVs) has faced hurdles due to the dominance of rare but critical corner cases within the long-tail distribution of driving scenarios, which negatively affects their overall performance. To address this challenge, adversarial generation methods have emerged as a class of efficient approaches to synthesize safety-critical scenarios for AV testing. However, these generated scenarios are often underutilized for AV training, resulting in the potential for continual AV policy improvement remaining untapped, along with a deficiency in the closed-loop design needed to achieve it. Therefore, we tailor the Stackelberg Driver Model (SDM) to accurately characterize the hierarchical nature of vehicle interaction dynamics, facilitating iterative improvement by engaging background vehicles (BVs) and AV in a sequential game-like interaction paradigm. With AV…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · Adversarial Robustness in Machine Learning · Real-time simulation and control systems
