End-to-End Urban Driving by Imitating a Reinforcement Learning Coach
Zhejun Zhang, Alexander Liniger, Dengxin Dai, Fisher Yu, Luc Van Gool

TL;DR
This paper introduces a reinforcement learning expert that provides superior supervision signals for end-to-end urban driving, achieving high success rates and better generalization in complex driving scenarios.
Contribution
It presents a reinforcement learning-based expert that outperforms rule-based methods and enhances imitation learning for autonomous urban driving.
Findings
Reinforcement learning expert sets new CARLA performance upper-bound.
Supervised agent achieves 78% success rate in diverse conditions.
State-of-the-art results on CARLA LeaderBoard and NoCrash-dense benchmark.
Abstract
End-to-end approaches to autonomous driving commonly rely on expert demonstrations. Although humans are good drivers, they are not good coaches for end-to-end algorithms that demand dense on-policy supervision. On the contrary, automated experts that leverage privileged information can efficiently generate large scale on-policy and off-policy demonstrations. However, existing automated experts for urban driving make heavy use of hand-crafted rules and perform suboptimally even on driving simulators, where ground-truth information is available. To address these issues, we train a reinforcement learning expert that maps bird's-eye view images to continuous low-level actions. While setting a new performance upper-bound on CARLA, our expert is also a better coach that provides informative supervision signals for imitation learning agents to learn from. Supervised by our reinforcement…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · Advanced Neural Network Applications · Reinforcement Learning in Robotics
MethodsEntropy Regularization · Proximal Policy Optimization · CARLA: An Open Urban Driving Simulator
