Deep Reinforcement Learning for Autonomous Driving

Sen Wang; Daoyuan Jia; Xinshuo Weng

arXiv:1811.11329·cs.CV·May 21, 2019·44 cites

Deep Reinforcement Learning for Autonomous Driving

Sen Wang, Daoyuan Jia, Xinshuo Weng

PDF

Open Access 1 Repo

TL;DR

This paper applies deep reinforcement learning, specifically DDPG, to autonomous driving in simulation, addressing complex state and action spaces while ensuring safety, and demonstrates promising results in TORCS environment.

Contribution

The paper adapts DDPG for autonomous driving in simulation, designing specific network architectures and reward functions for complex environments.

Findings

01

Effective in TORCS simulation environments

02

Handles continuous state and action spaces

03

Shows promising quantitative and qualitative results

Abstract

Reinforcement learning has steadily improved and outperform human in lots of traditional games since the resurgence of deep neural network. However, these success is not easy to be copied to autonomous driving because the state spaces in real world are extreme complex and action spaces are continuous and fine control is required. Moreover, the autonomous driving vehicles must also keep functional safety under the complex environments. To deal with these challenges, we first adopt the deep deterministic policy gradient (DDPG) algorithm, which has the capacity to handle complex state and action spaces in continuous domain. We then choose The Open Racing Car Simulator (TORCS) as our environment to avoid physical damage. Meanwhile, we select a set of appropriate sensor information from TORCS and design our own rewarder. In order to fit DDPG algorithm to TORCS, we design our network…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

NOHYC/autonomous_driving_car_project
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Adaptive Dynamic Programming Control

MethodsExperience Replay · Dense Connections · Weight Decay · *Communicated@Fast*How Do I Communicate to Expedia? · Adam · Convolution · Batch Normalization · Deep Deterministic Policy Gradient