Learning Robotic Manipulation Tasks via Task Progress based Gaussian   Reward and Loss Adjusted Exploration

Sulabh Kumra; Shirin Josh; Ferat Sahin

arXiv:2103.01434·cs.RO·November 22, 2021

Learning Robotic Manipulation Tasks via Task Progress based Gaussian Reward and Loss Adjusted Exploration

Sulabh Kumra, Shirin Josh, Ferat Sahin

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel deep reinforcement learning approach for robotic manipulation that uses a task progress-based reward and a loss-adjusted exploration policy, achieving state-of-the-art results in complex multi-step tasks.

Contribution

The paper presents RoManNet, a vision-based model architecture, along with TPG reward and LAE policy, to improve learning efficiency and success in multi-step robotic manipulation tasks.

Findings

01

Outperforms existing methods in success rate and efficiency

02

Effective in both simulation and real-world environments

03

Particularly beneficial for multi-block stacking tasks

Abstract

Multi-step manipulation tasks in unstructured environments are extremely challenging for a robot to learn. Such tasks interlace high-level reasoning that consists of the expected states that can be attained to achieve an overall task and low-level reasoning that decides what actions will yield these states. We propose a model-free deep reinforcement learning method to learn multi-step manipulation tasks. We introduce a Robotic Manipulation Network (RoManNet), which is a vision-based model architecture, to learn the action-value functions and predict manipulation action candidates. We define a Task Progress based Gaussian (TPG) reward function that computes the reward based on actions that lead to successful motion primitives and progress towards the overall task goal. To balance the ratio of exploration/exploitation, we introduce a Loss Adjusted Exploration (LAE) policy that determines…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

skumra/romannet
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · Adversarial Robustness in Machine Learning