RoboCraft: Learning to See, Simulate, and Shape Elasto-Plastic Objects   with Graph Networks

Haochen Shi; Huazhe Xu; Zhiao Huang; Yunzhu Li; Jiajun Wu

arXiv:2205.02909·cs.RO·May 9, 2022

RoboCraft: Learning to See, Simulate, and Shape Elasto-Plastic Objects with Graph Networks

Haochen Shi, Huazhe Xu, Zhiao Huang, Yunzhu Li, Jiajun Wu

PDF

Open Access

TL;DR

RoboCraft introduces a particle-based, graph neural network-driven system enabling robots to learn and manipulate complex elasto-plastic objects from minimal visual data, achieving generalization and human-level performance.

Contribution

It presents a novel particle-based representation and GNN-based dynamics model for elasto-plastic objects, integrated with model-predictive control for robotic manipulation.

Findings

01

Learned models enable shape deformation from minimal data

02

System generalizes to unseen shapes and complex actions

03

Performance matches or exceeds human manipulation in tests

Abstract

Modeling and manipulating elasto-plastic objects are essential capabilities for robots to perform complex industrial and household interaction tasks (e.g., stuffing dumplings, rolling sushi, and making pottery). However, due to the high degree of freedom of elasto-plastic objects, significant challenges exist in virtually every aspect of the robotic manipulation pipeline, e.g., representing the states, modeling the dynamics, and synthesizing the control signals. We propose to tackle these challenges by employing a particle-based representation for elasto-plastic objects in a model-based planning framework. Our system, RoboCraft, only assumes access to raw RGBD visual observations. It transforms the sensing data into particles and learns a particle-based dynamics model using graph neural networks (GNNs) to capture the structure of the underlying system. The learned model can then be…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Multimodal Machine Learning Applications · Human Pose and Action Recognition