ParticleFormer: A 3D Point Cloud World Model for Multi-Object, Multi-Material Robotic Manipulation

Suning Huang; Qianzhong Chen; Xiaohan Zhang; Jiankai Sun; Mac Schwager

arXiv:2506.23126·cs.RO·August 27, 2025

ParticleFormer: A 3D Point Cloud World Model for Multi-Object, Multi-Material Robotic Manipulation

Suning Huang, Qianzhong Chen, Xiaohan Zhang, Jiankai Sun, Mac Schwager

PDF

Open Access

TL;DR

ParticleFormer is a Transformer-based 3D point cloud world model that effectively captures multi-object, multi-material interactions directly from real-world data, enabling improved robotic manipulation and dynamics prediction without complex scene reconstruction.

Contribution

This work introduces ParticleFormer, a novel Transformer-based point cloud model that learns multi-material, multi-object dynamics directly from real-world data, surpassing existing single-material models.

Findings

01

Outperforms baselines in dynamics prediction accuracy

02

Achieves less rollout error in visuomotor tasks

03

Validated on six simulation and three real-world experiments

Abstract

3D world models (i.e., learning-based 3D dynamics models) offer a promising approach to generalizable robotic manipulation by capturing the underlying physics of environment evolution conditioned on robot actions. However, existing 3D world models are primarily limited to single-material dynamics using a particle-based Graph Neural Network model, and often require time-consuming 3D scene reconstruction to obtain 3D particle tracks for training. In this work, we present ParticleFormer, a Transformer-based point cloud world model trained with a hybrid point cloud reconstruction loss, supervising both global and local dynamics features in multi-material, multi-object robot interactions. ParticleFormer captures fine-grained multi-object interactions between rigid, deformable, and flexible materials, trained directly from real-world robot perception data without an elaborate scene…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Model Reduction and Neural Networks · Reinforcement Learning in Robotics