Avoidance Navigation Based on Offline Pre-Training Reinforcement   Learning

Yang Wenkai Ji Ruihang Zhang Yuxiang Lei Hao; Zhao Zijie

arXiv:2308.01551·cs.RO·August 4, 2023·1 cites

Avoidance Navigation Based on Offline Pre-Training Reinforcement Learning

Yang Wenkai Ji Ruihang Zhang Yuxiang Lei Hao, Zhao Zijie

PDF

Open Access

TL;DR

This paper introduces an offline pre-training deep reinforcement learning approach for mobile robot avoidance navigation that significantly reduces training time and demonstrates strong generalization in various environments.

Contribution

The study proposes a novel offline pre-training strategy with expert experience collection, improving training efficiency and generalization for robot avoidance navigation without maps.

Findings

01

Reduced training time by 80%

02

Doubled the reward compared to baseline DRL

03

Achieved collision-free navigation in multiple environments

Abstract

This paper presents a Pre-Training Deep Reinforcement Learning(DRL) for avoidance navigation without map for mobile robots which map raw sensor data to control variable and navigate in an unknown environment. The efficient offline training strategy is proposed to speed up the inefficient random explorations in early stage and we also collect a universal dataset including expert experience for offline training, which is of some significance for other navigation training work. The pre-training and prioritized expert experience are proposed to reduce 80\% training time and has been verified to improve the 2 times reward of DRL. The advanced simulation gazebo with real physical modelling and dynamic equations reduce the gap between sim-to-real. We train our model a corridor environment, and evaluate the model in different environment getting the same effect. Compared to traditional method…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotic Path Planning Algorithms · Reinforcement Learning in Robotics · Robotics and Automated Systems

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings