Action and Trajectory Planning for Urban Autonomous Driving with Hierarchical Reinforcement Learning
Xinyang Lu, Flint Xiaofeng Fan, Tianying Wang

TL;DR
This paper introduces a hierarchical reinforcement learning approach for urban autonomous driving that effectively handles complex multi-task scenarios with dynamic environments, improving decision-making and trajectory planning.
Contribution
The paper proposes atHRL, a hierarchical RL method that models urban driving tasks and plans trajectories using perception data, outperforming existing RL approaches.
Findings
Significant performance improvements over state-of-the-art RL methods.
Effective handling of multiple driving tasks in complex urban scenarios.
Robust trajectory planning in dynamic environments with other vehicles.
Abstract
Reinforcement Learning (RL) has made promising progress in planning and decision-making for Autonomous Vehicles (AVs) in simple driving scenarios. However, existing RL algorithms for AVs fail to learn critical driving skills in complex urban scenarios. First, urban driving scenarios require AVs to handle multiple driving tasks of which conventional RL algorithms are incapable. Second, the presence of other vehicles in urban scenarios results in a dynamically changing environment, which challenges RL algorithms to plan the action and trajectory of the AV. In this work, we propose an action and trajectory planner using Hierarchical Reinforcement Learning (atHRL) method, which models the agent behavior in a hierarchical model by using the perception of the lidar and birdeye view. The proposed atHRL method learns to make decisions about the agent's future trajectory and computes target…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · Traffic control and management · Reinforcement Learning in Robotics
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Entropy Regularization · Proximal Policy Optimization · fail · Convolution · Batch Normalization · Dense Connections · Experience Replay · Adam · Weight Decay
