Learning of Long-Horizon Sparse-Reward Robotic Manipulator Tasks with   Base Controllers

Guangming Wang; Minjian Xin; Wenhua Wu; Zhe Liu; Hesheng Wang

arXiv:2011.12105·cs.RO·December 7, 2021·1 cites

Learning of Long-Horizon Sparse-Reward Robotic Manipulator Tasks with Base Controllers

Guangming Wang, Minjian Xin, Wenhua Wu, Zhe Liu, Hesheng Wang

PDF

Open Access

TL;DR

This paper introduces a reinforcement learning method that integrates traditional base controllers to efficiently learn long-horizon, sparse-reward robotic manipulation tasks, significantly improving performance and sample efficiency.

Contribution

It proposes a novel approach combining base controllers with deep RL to enhance exploration and learning in complex robotic tasks, including synthesizing multiple controllers.

Findings

01

Outperforms base controllers in stacking and cup tasks

02

Achieves orders of magnitude better sample efficiency than learning from demonstrations

03

Demonstrates potential for industrial robot manipulation systems

Abstract

Deep Reinforcement Learning (DRL) enables robots to perform some intelligent tasks end-to-end. However, there are still many challenges for long-horizon sparse-reward robotic manipulator tasks. On the one hand, a sparse-reward setting causes exploration inefficient. On the other hand, exploration using physical robots is of high cost and unsafe. In this paper, we propose a method of learning long-horizon sparse-reward tasks utilizing one or more existing traditional controllers named base controllers in this paper. Built upon Deep Deterministic Policy Gradients (DDPG), our algorithm incorporates the existing base controllers into stages of exploration, value learning, and policy update. Furthermore, we present a straightforward way of synthesizing different base controllers to integrate their strengths. Through experiments ranging from stacking blocks to cups, it is demonstrated that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeurological disorders and treatments · Reinforcement Learning in Robotics · Neuroscience and Neural Engineering