Reinforced Imitative Trajectory Planning for Urban Automated Driving
Di Zeng, Ling Zheng, Xiantong Yang, Yinong Li

TL;DR
This paper introduces a novel RL-based trajectory planning approach for urban automated driving that combines imitation learning, multi-step planning, and a transformer-based Bayesian reward function to improve safety, interpretability, and performance.
Contribution
It proposes a multi-step RL planning method integrated with imitation learning and a transformer-based Bayesian reward function tailored for urban driving scenarios.
Findings
Significantly outperforms baseline methods in urban driving tasks.
Achieves competitive results with state-of-the-art approaches.
Validated on large-scale real-world urban dataset.
Abstract
Reinforcement learning (RL) faces challenges in trajectory planning for urban automated driving due to the poor convergence of RL and the difficulty in designing reward functions. Consequently, few RL-based trajectory planning methods can achieve performance comparable to that of imitation learning-based methods. The convergence problem is alleviated by combining RL with supervised learning. However, most existing approaches only reason one step ahead and lack the capability to plan for multiple future steps. Besides, although inverse reinforcement learning holds promise for solving the reward function design issue, existing methods for automated driving impose a linear structure assumption on reward functions, making them difficult to apply to urban automated driving. In light of these challenges, this paper proposes a novel RL-based trajectory planning method that integrates RL with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTransportation and Mobility Innovations · Autonomous Vehicle Technology and Safety · Robotic Path Planning Algorithms
