Interpretable End-to-end Urban Autonomous Driving with Latent Deep   Reinforcement Learning

Jianyu Chen; Shengbo Eben Li; Masayoshi Tomizuka

arXiv:2001.08726·cs.RO·July 8, 2020·24 cites

Interpretable End-to-end Urban Autonomous Driving with Latent Deep Reinforcement Learning

Jianyu Chen, Shengbo Eben Li, Masayoshi Tomizuka

PDF

Open Access 4 Repos

TL;DR

This paper introduces an interpretable end-to-end deep reinforcement learning approach for urban autonomous driving, integrating a latent environment model to enhance explainability, reduce sample complexity, and improve performance in complex city scenarios.

Contribution

It presents a novel latent environment model that jointly learns with reinforcement learning, enabling interpretability and better handling of urban driving complexities.

Findings

01

Outperforms baseline algorithms like DQN, DDPG, TD3, and SAC in urban scenarios.

02

Generates semantic bird's-eye masks for explaining policy behavior.

03

Reduces sample complexity of reinforcement learning.

Abstract

Unlike popular modularized framework, end-to-end autonomous driving seeks to solve the perception, decision and control problems in an integrated way, which can be more adapting to new scenarios and easier to generalize at scale. However, existing end-to-end approaches are often lack of interpretability, and can only deal with simple driving tasks like lane keeping. In this paper, we propose an interpretable deep reinforcement learning method for end-to-end autonomous driving, which is able to handle complex urban scenarios. A sequential latent environment model is introduced and learned jointly with the reinforcement learning process. With this latent model, a semantic birdeye mask can be generated, which is enforced to connect with a certain intermediate property in today's modularized framework for the purpose of explaining the behaviors of learned policy. The latent space also…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Autonomous Vehicle Technology and Safety · Explainable Artificial Intelligence (XAI)

MethodsEntropy Regularization · Proximal Policy Optimization · CARLA: An Open Urban Driving Simulator · Weight Decay · Convolution · Batch Normalization · Deep Deterministic Policy Gradient · Q-Learning · Deep Q-Network · Experience Replay