Reinforcement Learning for Robotic Manipulation using Simulated   Locomotion Demonstrations

Ozsel Kilinc; Giovanni Montana

arXiv:1910.07294·cs.LG·November 12, 2021

Reinforcement Learning for Robotic Manipulation using Simulated Locomotion Demonstrations

Ozsel Kilinc, Giovanni Montana

PDF

2 Repos

TL;DR

This paper introduces a novel reinforcement learning framework for robotic manipulation that leverages simulated object locomotion demonstrations to improve learning efficiency without human demonstrations.

Contribution

The authors propose using simulated object locomotion policies to generate auxiliary rewards, enabling effective RL for manipulation tasks without human demonstrations.

Findings

01

Achieves higher success rates across 13 tasks

02

Faster learning compared to alternative algorithms

03

Particularly effective for multi-object stacking and non-rigid objects

Abstract

Mastering robotic manipulation skills through reinforcement learning (RL) typically requires the design of shaped reward functions. Recent developments in this area have demonstrated that using sparse rewards, i.e. rewarding the agent only when the task has been successfully completed, can lead to better policies. However, state-action space exploration is more difficult in this case. Recent RL approaches to learning with sparse rewards have leveraged high-quality human demonstrations for the task, but these can be costly, time consuming or even impossible to obtain. In this paper, we propose a novel and effective approach that does not require human demonstrations. We observe that every robotic manipulation task could be seen as involving a locomotion task from the perspective of the object being manipulated, i.e. the object could learn how to reach a target state on its own. In order…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.