Action and Trajectory Planning for Urban Autonomous Driving with   Hierarchical Reinforcement Learning

Xinyang Lu; Flint Xiaofeng Fan; Tianying Wang

arXiv:2306.15968·cs.RO·June 29, 2023·2 cites

Action and Trajectory Planning for Urban Autonomous Driving with Hierarchical Reinforcement Learning

Xinyang Lu, Flint Xiaofeng Fan, Tianying Wang

PDF

Open Access

TL;DR

This paper introduces a hierarchical reinforcement learning approach for urban autonomous driving that effectively handles complex multi-task scenarios with dynamic environments, improving decision-making and trajectory planning.

Contribution

The paper proposes atHRL, a hierarchical RL method that models urban driving tasks and plans trajectories using perception data, outperforming existing RL approaches.

Findings

01

Significant performance improvements over state-of-the-art RL methods.

02

Effective handling of multiple driving tasks in complex urban scenarios.

03

Robust trajectory planning in dynamic environments with other vehicles.

Abstract

Reinforcement Learning (RL) has made promising progress in planning and decision-making for Autonomous Vehicles (AVs) in simple driving scenarios. However, existing RL algorithms for AVs fail to learn critical driving skills in complex urban scenarios. First, urban driving scenarios require AVs to handle multiple driving tasks of which conventional RL algorithms are incapable. Second, the presence of other vehicles in urban scenarios results in a dynamically changing environment, which challenges RL algorithms to plan the action and trajectory of the AV. In this work, we propose an action and trajectory planner using Hierarchical Reinforcement Learning (atHRL) method, which models the agent behavior in a hierarchical model by using the perception of the lidar and birdeye view. The proposed atHRL method learns to make decisions about the agent's future trajectory and computes target…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAutonomous Vehicle Technology and Safety · Traffic control and management · Reinforcement Learning in Robotics

Methods*Communicated@Fast*How Do I Communicate to Expedia? · Entropy Regularization · Proximal Policy Optimization · fail · Convolution · Batch Normalization · Dense Connections · Experience Replay · Adam · Weight Decay