Stackelberg Driver Model for Continual Policy Improvement in   Scenario-Based Closed-Loop Autonomous Driving

Haoyi Niu; Qimao Chen; Yingyue Li; Yi Zhang; Jianming Hu

arXiv:2309.14235·cs.LG·December 6, 2023·1 cites

Stackelberg Driver Model for Continual Policy Improvement in Scenario-Based Closed-Loop Autonomous Driving

Haoyi Niu, Qimao Chen, Yingyue Li, Yi Zhang, Jianming Hu

PDF

Open Access 1 Repo

TL;DR

This paper introduces the Stackelberg Driver Model (SDM), a hierarchical game-based approach that enables continual policy improvement in autonomous driving by iteratively challenging AVs with adversarial background vehicles.

Contribution

The paper presents a novel leader-follower game framework that models vehicle interactions to facilitate ongoing autonomous vehicle policy refinement using adversarial scenario generation.

Findings

01

SDM outperforms baseline methods in complex scenarios

02

The approach enables continuous AV policy enhancement

03

Generated scenarios become increasingly challenging over iterations

Abstract

The deployment of autonomous vehicles (AVs) has faced hurdles due to the dominance of rare but critical corner cases within the long-tail distribution of driving scenarios, which negatively affects their overall performance. To address this challenge, adversarial generation methods have emerged as a class of efficient approaches to synthesize safety-critical scenarios for AV testing. However, these generated scenarios are often underutilized for AV training, resulting in the potential for continual AV policy improvement remaining untapped, along with a deficiency in the closed-loop design needed to achieve it. Therefore, we tailor the Stackelberg Driver Model (SDM) to accurately characterize the hierarchical nature of vehicle interaction dynamics, facilitating iterative improvement by engaging background vehicles (BVs) and AV in a sequential game-like interaction paradigm. With AV…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

BlueCat-de/SDM
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAutonomous Vehicle Technology and Safety · Adversarial Robustness in Machine Learning · Real-time simulation and control systems