Self-Adapting Recurrent Models for Object Pushing from Learning in Simulation
Lin Cong, Michael G\"orner, Philipp Ruppel, Hongzhuo Liang, Norman, Hendrich, Jianwei Zhang

TL;DR
This paper introduces a self-adapting recurrent model for object pushing that is trained in simulation and adapts quickly to real-world dynamics, improving robotic pushing performance with minimal real data.
Contribution
It presents a novel recurrent model predictive control framework that adapts to real object dynamics using simulation-trained models and domain randomization.
Findings
The model adapts to real object dynamics within a few steps.
The proposed RMPPI algorithm improves path planning with recurrent models.
Experiments show effective pushing on a UR5 robot platform.
Abstract
Planar pushing remains a challenging research topic, where building the dynamic model of the interaction is the core issue. Even an accurate analytical dynamic model is inherently unstable because physics parameters such as inertia and friction can only be approximated. Data-driven models usually rely on large amounts of training data, but data collection is time consuming when working with real robots. In this paper, we collect all training data in a physics simulator and build an LSTM-based model to fit the pushing dynamics. Domain Randomization is applied to capture the pushing trajectories of a generalized class of objects. When executed on the real robot, the trained recursive model adapts to the tracked object's real dynamics within a few steps. We propose the algorithm \emph{Recurrent} Model Predictive Path Integral (RMPPI) as a variation of the original MPPI approach,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Robotic Path Planning Algorithms
