Reinforcement Learning for Pivoting Task
Rika Antonova, Silvia Cruciani, Christian Smith, Danica Kragic

TL;DR
This paper presents a reinforcement learning approach that trains policies in a simple simulator to perform pivoting tasks on real robots, demonstrating robustness and generalization despite simulation-reality mismatch.
Contribution
It introduces a training procedure enabling the use of simple simulators to learn robust policies transferable to real-world pivoting tasks.
Findings
Policy successfully pivots objects to target angles on real robot
Demonstrates generalization to objects with different properties
Reduces training episodes needed compared to previous methods
Abstract
In this work we propose an approach to learn a robust policy for solving the pivoting task. Recently, several model-free continuous control algorithms were shown to learn successful policies without prior knowledge of the dynamics of the task. However, obtaining successful policies required thousands to millions of training episodes, limiting the applicability of these approaches to real hardware. We developed a training procedure that allows us to use a simple custom simulator to learn policies robust to the mismatch of simulation vs robot. In our experiments, we demonstrate that the policy learned in the simulator is able to pivot the object to the desired target angle on the real robot. We also show generalization to an object with different inertia, shape, mass and friction properties than those used during training. This result is a step towards making model-free reinforcement…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Advanced Control Systems Optimization · Adaptive Dynamic Programming Control
