Adapting Reinforcement Learning for Path Planning in Constrained Parking Scenarios

Feng Tao; Luca Paparusso; Chenyi Gu; Robin Koehler; Chenxu Wu; Xinyu Huang; Christian Juette; David Paz; Ren Liu

arXiv:2601.22545·cs.RO·February 2, 2026

Adapting Reinforcement Learning for Path Planning in Constrained Parking Scenarios

Feng Tao, Luca Paparusso, Chenyi Gu, Robin Koehler, Chenxu Wu, Xinyu Huang, Christian Juette, David Paz, Ren Liu

PDF

Open Access 3 Reviews

TL;DR

This paper presents a deep reinforcement learning framework for real-time path planning in constrained parking scenarios, outperforming classical methods in success rate and efficiency without relying on ideal perception or complex modules.

Contribution

Introduces a DRL-based path planning approach that handles complex parking environments, with a new benchmark for training and evaluation, enabling practical real-time autonomous navigation.

Findings

01

Achieves +96% success rate over classical planners.

02

Improves efficiency by +52% in path planning tasks.

03

Provides an open-source benchmark for future research.

Abstract

Real-time path planning in constrained environments remains a fundamental challenge for autonomous systems. Traditional classical planners, while effective under perfect perception assumptions, are often sensitive to real-world perception constraints and rely on online search procedures that incur high computational costs. In complex surroundings, this renders real-time deployment prohibitive. To overcome these limitations, we introduce a Deep Reinforcement Learning (DRL) framework for real-time path planning in parking scenarios. In particular, we focus on challenging scenes with tight spaces that require a high number of reversal maneuvers and adjustments. Unlike classical planners, our solution does not require ideal and structured perception, and in principle, could avoid the need for additional modules such as localization and tracking, resulting in a simpler and more practical…

Peer Reviews

Decision·Submitted to ICLR 2026

Reviewer 01Rating 6Confidence 3

Strengths

The performance compared to the baseline is quite impressive. The paper also contributes a benchmark. Including a bicycle model introduces an easily tunable component, if more complex vehicle models are needed, and that should ensure feasible trajectories. The benchmark provides very simple lidar scans which may seem like a disadvantage but is an easily transferable representation that should be easy to provide in many contexts. It is also very easy to simulate as the paper shows. The appro

Weaknesses

While having impressive results, they are on a novel self-created benchmark which did not give other authors opportunity to optimize on it. Therefore, results have to be taken with a grain of salt and if the benchmark is accepted in the domain time must tell how these results hold up. It seems more direction changes are needed, compared to a simple A* algorithm. While other metrics indicate better performance it would be interesting how this scales, i.e. how many more pivot points, which should

Reviewer 02Rating 2Confidence 4

Strengths

* Strong empirical results: Achieves 92.2% success on ParkBench vs. 47.1% for Hybrid A*, and 2× improvement in time efficiency. * Benchmark contribution: ParkBench fills a benchmark gap in parking evaluation, providing 51 realistic layouts for reproducibility and comparison.

Weaknesses

* The RL system presented in this paper is fairly straight forward. The component includes a handcrafted curriculum for initial configuration and motion primitive (action chunking). These components are well-established in the literature and the author does not demonstrate sufficient effort in integrating these components as a whole system. * There exists a lot of RL-based motion planning for mobile robot, many of them are trained in high-fidelity simulator, such as Gazebo and IsaacLab. The au

Reviewer 03Rating 2Confidence 5

Strengths

1. The paper shows the effectiveness of using the PPO algorithm as a planner in parking scenarios. 2. It introduces a new benchmark, ParkBench, to facilitate research on path planning in parking environments.

Weaknesses

1. In the introduction, the authors claim that RL-based methods, as representatives of closed-loop approaches, remain underexplored in path planning. However, RL-based planners have been extensively studied in various path planning tasks that consider practical constraints across diverse real-world applications, including transportation, warehousing, and surgical robotics. 2. Numerous studies have focused on developing RL-based planners in related domains. Although this paper centers on parking

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotic Path Planning Algorithms · Autonomous Vehicle Technology and Safety · Reinforcement Learning in Robotics