Reinforcement Learning for Pivoting Task

Rika Antonova; Silvia Cruciani; Christian Smith; Danica Kragic

arXiv:1703.00472·cs.RO·March 3, 2017·36 cites

Reinforcement Learning for Pivoting Task

Rika Antonova, Silvia Cruciani, Christian Smith, Danica Kragic

PDF

Open Access 1 Repo

TL;DR

This paper presents a reinforcement learning approach that trains policies in a simple simulator to perform pivoting tasks on real robots, demonstrating robustness and generalization despite simulation-reality mismatch.

Contribution

It introduces a training procedure enabling the use of simple simulators to learn robust policies transferable to real-world pivoting tasks.

Findings

01

Policy successfully pivots objects to target angles on real robot

02

Demonstrates generalization to objects with different properties

03

Reduces training episodes needed compared to previous methods

Abstract

In this work we propose an approach to learn a robust policy for solving the pivoting task. Recently, several model-free continuous control algorithms were shown to learn successful policies without prior knowledge of the dynamics of the task. However, obtaining successful policies required thousands to millions of training episodes, limiting the applicability of these approaches to real hardware. We developed a training procedure that allows us to use a simple custom simulator to learn policies robust to the mismatch of simulation vs robot. In our experiments, we demonstrate that the policy learned in the simulator is able to pivot the object to the desired target angle on the real robot. We also show generalization to an object with different inertia, shape, mass and friction properties than those used during training. This result is a step towards making model-free reinforcement…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

LeoToledo/PivotingTaskRL
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Advanced Control Systems Optimization · Adaptive Dynamic Programming Control