Using Implicit Behavior Cloning and Dynamic Movement Primitive to   Facilitate Reinforcement Learning for Robot Motion Planning

Zengjie Zhang; Jayden Hong; Amir Soufi Enayati; and Homayoun Najjaran

arXiv:2307.16062·cs.RO·August 20, 2024

Using Implicit Behavior Cloning and Dynamic Movement Primitive to Facilitate Reinforcement Learning for Robot Motion Planning

Zengjie Zhang, Jayden Hong, Amir Soufi Enayati, and Homayoun Najjaran

PDF

Open Access

TL;DR

This paper introduces a new RL framework for robot motion planning that combines implicit behavior cloning and dynamic movement primitives to enhance training efficiency and generalization, validated through simulation and real-robot experiments.

Contribution

The paper presents a novel integration of IBC and DMP with RL for improved robot motion planning, including a new human demonstration dataset and validation in simulation and real-world tasks.

Findings

01

Faster training speed compared to conventional RL methods

02

Higher performance scores in simulation tasks

03

Successful application to a simple assembly task in real robot experiments

Abstract

Reinforcement learning (RL) for motion planning of multi-degree-of-freedom robots still suffers from low efficiency in terms of slow training speed and poor generalizability. In this paper, we propose a novel RL-based robot motion planning framework that uses implicit behavior cloning (IBC) and dynamic movement primitive (DMP) to improve the training speed and generalizability of an off-policy RL agent. IBC utilizes human demonstration data to leverage the training speed of RL, and DMP serves as a heuristic model that transfers motion planning into a simpler planning space. To support this, we also create a human demonstration dataset using a pick-and-place experiment that can be used for similar studies. Comparison studies in simulation reveal the advantage of the proposed method over the conventional RL agents with faster training speed and higher scores. A real-robot experiment…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Evolutionary Algorithms and Applications

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings