Learning Pivoting Manipulation with Force and Vision Feedback Using Optimization-based Demonstrations
Yuki Shirai, Kei Ota, Devesh K. Jha, Diego Romeres

TL;DR
This paper introduces a hybrid learning framework combining model-based optimization and deep reinforcement learning for efficient, robust pivoting manipulation using force and vision feedback, with successful sim-to-real transfer.
Contribution
It proposes a novel demonstration-guided RL approach leveraging CITO for sample efficiency and a privileged training strategy for sim-to-real transfer without privileged info.
Findings
Successful pivoting tasks in simulation and real-world
Sample-efficient learning with demonstration guidance
Effective sim-to-real transfer using proprioception, vision, and force sensing
Abstract
Non-prehensile manipulation is challenging due to complex contact interactions between objects, the environment, and robots. Model-based approaches can efficiently generate complex trajectories of robots and objects under contact constraints. However, they tend to be sensitive to model inaccuracies and require access to privileged information (e.g., object mass, size, pose), making them less suitable for novel objects. In contrast, learning-based approaches are typically more robust to modeling errors but require large amounts of data. In this paper, we bridge these two approaches to propose a framework for learning closed-loop pivoting manipulation. By leveraging computationally efficient Contact-Implicit Trajectory Optimization (CITO), we design demonstration-guided deep Reinforcement Learning (RL), leading to sample-efficient learning. We also present a sim-to-real transfer approach…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
