Reinforcement Learning for Robot Navigation with Adaptive Forward   Simulation Time (AFST) in a Semi-Markov Model

Yu'an Chen; Ruosong Ye; Ziyang Tao; Hongjian Liu; Guangda Chen; Jie; Peng; Jun Ma; Yu Zhang; Jianmin Ji; Yanyong Zhang

arXiv:2108.06161·cs.RO·July 6, 2023

Reinforcement Learning for Robot Navigation with Adaptive Forward Simulation Time (AFST) in a Semi-Markov Model

Yu'an Chen, Ruosong Ye, Ziyang Tao, Hongjian Liu, Guangda Chen, Jie, Peng, Jun Ma, Yu Zhang, Jianmin Ji, Yanyong Zhang

PDF

Open Access 1 Repo

TL;DR

This paper introduces AFST, a novel DRL-based robot navigation method using a semi-Markov decision process with adaptive simulation time to effectively handle local minima in complex unknown environments.

Contribution

The paper presents the first DRL navigation approach modeled by a semi-Markov decision process with continuous actions, incorporating adaptive simulation time to improve navigation in unknown environments.

Findings

01

AFST outperforms existing methods in unknown environments.

02

Modified GAE enhances policy gradient estimation in SMDPs.

03

Experimental results validate the effectiveness of AFST.

Abstract

Deep reinforcement learning (DRL) algorithms have proven effective in robot navigation, especially in unknown environments, by directly mapping perception inputs into robot control commands. However, most existing methods ignore the local minimum problem in navigation and thereby cannot handle complex unknown environments. In this paper, we propose the first DRL-based navigation method modeled by a semi-Markov decision process (SMDP) with continuous action space, named Adaptive Forward Simulation Time (AFST), to overcome this problem. Specifically, we reduce the dimensions of the action space and improve the distributed proximal policy optimization (DPPO) algorithm for the specified SMDP problem by modifying its GAE to better estimate the policy gradient in SMDPs. Experiments in various unknown environments demonstrate the effectiveness of AFST.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yohannnchen/afst
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robotic Path Planning Algorithms · Modular Robots and Swarm Intelligence