Boosting Offline Reinforcement Learning for Autonomous Driving with   Hierarchical Latent Skills

Zenan Li; Fan Nie; Qiao Sun; Fang Da; Hang Zhao

arXiv:2309.13614·cs.RO·November 20, 2023

Boosting Offline Reinforcement Learning for Autonomous Driving with Hierarchical Latent Skills

Zenan Li, Fan Nie, Qiao Sun, Fang Da, Hang Zhao

PDF

Open Access

TL;DR

This paper introduces a hierarchical skill-based framework utilizing a variational autoencoder to improve offline reinforcement learning for long-horizon autonomous vehicle planning, demonstrating superior performance and interpretability in simulation.

Contribution

The work develops a novel skill learning approach with a two-branch VAE to enhance offline RL for extended vehicle planning tasks, addressing long-horizon challenges.

Findings

01

Outperforms strong baselines in CARLA simulations

02

Enables long-term reasoning with learned skills

03

Shows interpretability and transferability of skills

Abstract

Learning-based vehicle planning is receiving increasing attention with the emergence of diverse driving simulators and large-scale driving datasets. While offline reinforcement learning (RL) is well suited for these safety-critical tasks, it still struggles to plan over extended periods. In this work, we present a skill-based framework that enhances offline RL to overcome the long-horizon vehicle planning challenge. Specifically, we design a variational autoencoder (VAE) to learn skills from offline demonstrations. To mitigate posterior collapse of common VAEs, we introduce a two-branch sequence encoder to capture both discrete options and continuous variations of the complex driving skills. The final policy treats learned skills as actions and can be trained by any off-the-shelf offline RL algorithms. This facilitates a shift in focus from per-step actions to temporally extended…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAutonomous Vehicle Technology and Safety · Reinforcement Learning in Robotics · Energy, Environment, and Transportation Policies

MethodsEntropy Regularization · Proximal Policy Optimization · Focus · CARLA: An Open Urban Driving Simulator