Deep Reinforcement Learning with a Stage Incentive Mechanism of Dense   Reward for Robotic Trajectory Planning

Gang Peng; Jin Yang; Xinde Lia; Mohammad Omar Khyam

arXiv:2009.12068·cs.AI·May 25, 2021

Deep Reinforcement Learning with a Stage Incentive Mechanism of Dense Reward for Robotic Trajectory Planning

Gang Peng, Jin Yang, Xinde Lia, Mohammad Omar Khyam

PDF

TL;DR

This paper introduces dense reward functions and a stage incentive mechanism to enhance deep reinforcement learning efficiency in robotic trajectory planning, achieving faster convergence and higher success rates in manipulator tasks.

Contribution

It proposes novel dense reward functions and a stage incentive mechanism inspired by human cognition, significantly improving learning speed and stability in DRL-based robot trajectory planning.

Findings

01

Soft stage incentive reward improves convergence rate by up to 46.9%.

02

Success rate of trajectory planning reaches 99.6%.

03

Reductions in standard deviation of rewards indicate more stable learning.

Abstract

(This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible.) To improve the efficiency of deep reinforcement learning (DRL)-based methods for robot manipulator trajectory planning in random working environments, we present three dense reward functions. These rewards differ from the traditional sparse reward. First, a posture reward function is proposed to speed up the learning process with a more reasonable trajectory by modeling the distance and direction constraints, which can reduce the blindness of exploration. Second, a stride reward function is proposed to improve the stability of the learning process by modeling the distance and movement distance of joint constraints. Finally, in order to further improve learning efficiency, we are inspired by the cognitive process of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.