End-to-End Race Driving with Deep Reinforcement Learning

Maximilian Jaritz; Raoul de Charette; Marin Toromanoff; Etienne Perot,; Fawzi Nashashibi

arXiv:1807.02371·cs.CV·September 3, 2018

End-to-End Race Driving with Deep Reinforcement Learning

Maximilian Jaritz, Raoul de Charette, Marin Toromanoff, Etienne Perot,, Fawzi Nashashibi

PDF

Open Access

TL;DR

This paper introduces a deep reinforcement learning approach for end-to-end race driving that learns directly from RGB images, achieving robust control and generalization in a realistic rally simulation.

Contribution

It proposes new reward and learning strategies within an A3C framework for end-to-end driving without mediated perception, demonstrating improved convergence and robustness.

Findings

01

Faster convergence with new reward strategies

02

Robust driving across diverse tracks and conditions

03

Some domain adaptation to real image sequences

Abstract

We present research using the latest reinforcement learning algorithm for end-to-end driving without any mediated perception (object recognition, scene understanding). The newly proposed reward and learning strategies lead together to faster convergence and more robust driving using only RGB image from a forward facing camera. An Asynchronous Actor Critic (A3C) framework is used to learn the car control in a physically and graphically realistic rally game, with the agents evolving simultaneously on tracks with a variety of road structures (turns, hills), graphics (seasons, location) and physics (road adherence). A thorough evaluation is conducted and generalization is proven on unseen tracks and using legal speed limits. Open loop tests on real sequences of images show some domain adaption capability of our method.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Autonomous Vehicle Technology and Safety · Advanced Neural Network Applications

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings