Improve Generalization of Driving Policy at Signalized Intersections with Adversarial Learning
Yangang Ren, Guojian Zhan, Liye Tang, Shengbo Eben Li, Jianhua Jiang, and Jingliang Duan

TL;DR
This paper presents an adversarial learning framework to improve the robustness and safety of autonomous driving policies at signalized intersections, effectively handling environmental uncertainties and diverse traffic behaviors.
Contribution
It introduces a novel adversarial policy gradient method combined with a static path planner and optimal control to enhance driving policy robustness at complex intersections.
Findings
Enhanced policy robustness against uncertainties
Improved safety and efficiency in intersection navigation
Large-margin resistance to abnormal behaviors
Abstract
Intersections are quite challenging among various driving scenes wherein the interaction of signal lights and distinct traffic actors poses great difficulty to learn a wise and robust driving policy. Current research rarely considers the diversity of intersections and stochastic behaviors of traffic participants. For practical applications, the randomness usually leads to some devastating events, which should be the focus of autonomous driving. This paper introduces an adversarial learning paradigm to boost the intelligence and robustness of driving policy for signalized intersections with dense traffic flow. Firstly, we design a static path planner which is capable of generating trackable candidate paths for multiple intersections with diversified topology. Next, a constrained optimal control problem (COCP) is built based on these candidate paths wherein the bounded uncertainty of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · Adversarial Robustness in Machine Learning · Traffic control and management
