End-to-End Urban Driving by Imitating a Reinforcement Learning Coach

Zhejun Zhang; Alexander Liniger; Dengxin Dai; Fisher Yu; Luc Van Gool

arXiv:2108.08265·cs.CV·October 6, 2021

End-to-End Urban Driving by Imitating a Reinforcement Learning Coach

Zhejun Zhang, Alexander Liniger, Dengxin Dai, Fisher Yu, Luc Van Gool

PDF

Open Access 3 Repos

TL;DR

This paper introduces a reinforcement learning expert that provides superior supervision signals for end-to-end urban driving, achieving high success rates and better generalization in complex driving scenarios.

Contribution

It presents a reinforcement learning-based expert that outperforms rule-based methods and enhances imitation learning for autonomous urban driving.

Findings

01

Reinforcement learning expert sets new CARLA performance upper-bound.

02

Supervised agent achieves 78% success rate in diverse conditions.

03

State-of-the-art results on CARLA LeaderBoard and NoCrash-dense benchmark.

Abstract

End-to-end approaches to autonomous driving commonly rely on expert demonstrations. Although humans are good drivers, they are not good coaches for end-to-end algorithms that demand dense on-policy supervision. On the contrary, automated experts that leverage privileged information can efficiently generate large scale on-policy and off-policy demonstrations. However, existing automated experts for urban driving make heavy use of hand-crafted rules and perform suboptimally even on driving simulators, where ground-truth information is available. To address these issues, we train a reinforcement learning expert that maps bird's-eye view images to continuous low-level actions. While setting a new performance upper-bound on CARLA, our expert is also a better coach that provides informative supervision signals for imitation learning agents to learn from. Supervised by our reinforcement…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAutonomous Vehicle Technology and Safety · Advanced Neural Network Applications · Reinforcement Learning in Robotics

MethodsEntropy Regularization · Proximal Policy Optimization · CARLA: An Open Urban Driving Simulator