Research on reinforcement learning based warehouse robot navigation   algorithm in complex warehouse layout

Keqin Li; Lipeng Liu; Jiajing Chen; Dezhi Yu; Xiaofan Zhou; Ming Li,; Congyu Wang; Zhao Li

arXiv:2411.06128·cs.RO·November 12, 2024·6 cites

Research on reinforcement learning based warehouse robot navigation algorithm in complex warehouse layout

Keqin Li, Lipeng Liu, Jiajing Chen, Dezhi Yu, Xiaofan Zhou, Ming Li,, Congyu Wang, Zhao Li

PDF

Open Access

TL;DR

This paper introduces a novel navigation algorithm combining PPO and Dijkstra's algorithm for warehouse robots, significantly improving path accuracy and robustness in complex layouts through efficient learning and global path planning.

Contribution

The paper proposes the PP-D algorithm, integrating PPO and Dijkstra's algorithm, for improved real-time decision making and path optimality in complex warehouse environments.

Findings

01

PP-D outperforms traditional algorithms in navigation accuracy.

02

The method reduces collisions and stagnation in complex layouts.

03

Experimental results confirm the system's robustness and efficiency.

Abstract

In this paper, how to efficiently find the optimal path in complex warehouse layout and make real-time decision is a key problem. This paper proposes a new method of Proximal Policy Optimization (PPO) and Dijkstra's algorithm, Proximal policy-Dijkstra (PP-D). PP-D method realizes efficient strategy learning and real-time decision making through PPO, and uses Dijkstra algorithm to plan the global optimal path, thus ensuring high navigation accuracy and significantly improving the efficiency of path planning. Specifically, PPO enables robots to quickly adapt and optimize action strategies in dynamic environments through its stable policy updating mechanism. Dijkstra's algorithm ensures global optimal path planning in static environment. Finally, through the comparison experiment and analysis of the proposed framework with the traditional algorithm, the results show that the PP-D method…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Manufacturing and Logistics Optimization

MethodsEntropy Regularization · Proximal Policy Optimization