TL;DR
This paper introduces a hierarchical reinforcement learning framework combined with a PID controller for autonomous vehicle trajectory planning, improving safety, robustness, and efficiency in dynamic environments.
Contribution
The paper presents a novel HRL-based trajectory planning method with sub-goals and LSTM integration, enhancing generalization, safety, and computational efficiency over existing approaches.
Findings
Reduces convergence time in trajectory planning.
Generates smoother and safer trajectories.
Handles noisy perception and dynamic environments effectively.
Abstract
Planning safe trajectories under uncertain and dynamic conditions makes the autonomous driving problem significantly complex. Current sampling-based methods such as Rapidly Exploring Random Trees (RRTs) are not ideal for this problem because of the high computational cost. Supervised learning methods such as Imitation Learning lack generalization and safety guarantees. To address these problems and in order to ensure a robust framework, we propose a Hierarchical Reinforcement Learning (HRL) structure combined with a Proportional-Integral-Derivative (PID) controller for trajectory planning. HRL helps divide the task of autonomous vehicle driving into sub-goals and supports the network to learn policies for both high-level options and low-level trajectory planner choices. The introduction of sub-goals decreases convergence time and enables the policies learned to be reused for other…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsEntropy Regularization · Proximal Policy Optimization · CARLA: An Open Urban Driving Simulator
