Research on reinforcement learning based warehouse robot navigation algorithm in complex warehouse layout
Keqin Li, Lipeng Liu, Jiajing Chen, Dezhi Yu, Xiaofan Zhou, Ming Li,, Congyu Wang, Zhao Li

TL;DR
This paper introduces a novel navigation algorithm combining PPO and Dijkstra's algorithm for warehouse robots, significantly improving path accuracy and robustness in complex layouts through efficient learning and global path planning.
Contribution
The paper proposes the PP-D algorithm, integrating PPO and Dijkstra's algorithm, for improved real-time decision making and path optimality in complex warehouse environments.
Findings
PP-D outperforms traditional algorithms in navigation accuracy.
The method reduces collisions and stagnation in complex layouts.
Experimental results confirm the system's robustness and efficiency.
Abstract
In this paper, how to efficiently find the optimal path in complex warehouse layout and make real-time decision is a key problem. This paper proposes a new method of Proximal Policy Optimization (PPO) and Dijkstra's algorithm, Proximal policy-Dijkstra (PP-D). PP-D method realizes efficient strategy learning and real-time decision making through PPO, and uses Dijkstra algorithm to plan the global optimal path, thus ensuring high navigation accuracy and significantly improving the efficiency of path planning. Specifically, PPO enables robots to quickly adapt and optimize action strategies in dynamic environments through its stable policy updating mechanism. Dijkstra's algorithm ensures global optimal path planning in static environment. Finally, through the comparison experiment and analysis of the proposed framework with the traditional algorithm, the results show that the PP-D method…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Manufacturing and Logistics Optimization
MethodsEntropy Regularization · Proximal Policy Optimization
