Interpretable End-to-end Urban Autonomous Driving with Latent Deep Reinforcement Learning
Jianyu Chen, Shengbo Eben Li, Masayoshi Tomizuka

TL;DR
This paper introduces an interpretable end-to-end deep reinforcement learning approach for urban autonomous driving, integrating a latent environment model to enhance explainability, reduce sample complexity, and improve performance in complex city scenarios.
Contribution
It presents a novel latent environment model that jointly learns with reinforcement learning, enabling interpretability and better handling of urban driving complexities.
Findings
Outperforms baseline algorithms like DQN, DDPG, TD3, and SAC in urban scenarios.
Generates semantic bird's-eye masks for explaining policy behavior.
Reduces sample complexity of reinforcement learning.
Abstract
Unlike popular modularized framework, end-to-end autonomous driving seeks to solve the perception, decision and control problems in an integrated way, which can be more adapting to new scenarios and easier to generalize at scale. However, existing end-to-end approaches are often lack of interpretability, and can only deal with simple driving tasks like lane keeping. In this paper, we propose an interpretable deep reinforcement learning method for end-to-end autonomous driving, which is able to handle complex urban scenarios. A sequential latent environment model is introduced and learned jointly with the reinforcement learning process. With this latent model, a semantic birdeye mask can be generated, which is enforced to connect with a certain intermediate property in today's modularized framework for the purpose of explaining the behaviors of learned policy. The latent space also…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Autonomous Vehicle Technology and Safety · Explainable Artificial Intelligence (XAI)
MethodsEntropy Regularization · Proximal Policy Optimization · CARLA: An Open Urban Driving Simulator · Weight Decay · Convolution · Batch Normalization · Deep Deterministic Policy Gradient · Q-Learning · Deep Q-Network · Experience Replay
