Real-Robot Deep Reinforcement Learning: Improving Trajectory Tracking of   Flexible-Joint Manipulator with Reference Correction

Dmytro Pavlichenko; Sven Behnke

arXiv:2203.07051·cs.RO·March 15, 2022

Real-Robot Deep Reinforcement Learning: Improving Trajectory Tracking of Flexible-Joint Manipulator with Reference Correction

Dmytro Pavlichenko, Sven Behnke

PDF

Open Access

TL;DR

This paper presents a deep reinforcement learning approach for real-robot trajectory tracking of flexible-joint manipulators, achieving rapid learning and improved accuracy over traditional controllers.

Contribution

The work introduces a model-free, off-policy RL method with reference correction and informed initialization for efficient real-robot learning.

Findings

01

Learned policy in under two hours on real robot

02

Significantly improved trajectory tracking accuracy

03

Generalizes to unseen payloads

Abstract

Flexible-joint manipulators are governed by complex nonlinear dynamics, defining a challenging control problem. In this work, we propose an approach to learn an outer-loop joint trajectory tracking controller with deep reinforcement learning. The controller represented by a stochastic policy is learned in under two hours directly on the real robot. This is achieved through bounded reference correction actions and use of a model-free off-policy learning method. In addition, an informed policy initialization is proposed, where the agent is pre-trained in a learned simulation. We test our approach on the 7 DOF manipulator of a Baxter robot. We demonstrate that the proposed method is capable of consistent learning across multiple runs when applied directly on the real robot. Our method yields a policy which significantly improves the trajectory tracking accuracy in comparison to the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Prosthetics and Rehabilitation Robotics · Mechanical Circulatory Support Devices