Boosting Offline Reinforcement Learning for Autonomous Driving with Hierarchical Latent Skills
Zenan Li, Fan Nie, Qiao Sun, Fang Da, Hang Zhao

TL;DR
This paper introduces a hierarchical skill-based framework utilizing a variational autoencoder to improve offline reinforcement learning for long-horizon autonomous vehicle planning, demonstrating superior performance and interpretability in simulation.
Contribution
The work develops a novel skill learning approach with a two-branch VAE to enhance offline RL for extended vehicle planning tasks, addressing long-horizon challenges.
Findings
Outperforms strong baselines in CARLA simulations
Enables long-term reasoning with learned skills
Shows interpretability and transferability of skills
Abstract
Learning-based vehicle planning is receiving increasing attention with the emergence of diverse driving simulators and large-scale driving datasets. While offline reinforcement learning (RL) is well suited for these safety-critical tasks, it still struggles to plan over extended periods. In this work, we present a skill-based framework that enhances offline RL to overcome the long-horizon vehicle planning challenge. Specifically, we design a variational autoencoder (VAE) to learn skills from offline demonstrations. To mitigate posterior collapse of common VAEs, we introduce a two-branch sequence encoder to capture both discrete options and continuous variations of the complex driving skills. The final policy treats learned skills as actions and can be trained by any off-the-shelf offline RL algorithms. This facilitates a shift in focus from per-step actions to temporally extended…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · Reinforcement Learning in Robotics · Energy, Environment, and Transportation Policies
MethodsEntropy Regularization · Proximal Policy Optimization · Focus · CARLA: An Open Urban Driving Simulator
