Gaussian Splatting Visual MPC for Granular Media Manipulation
Wei-Cheng Tseng, Ellina Zhang, Krishna Murthy Jatavallabhula and, Florian Shkurti

TL;DR
This paper introduces a Gaussian splatting-based visual dynamics model for granular media manipulation, enabling efficient planning and generalization in complex, particle-based environments using Model-Predictive Control.
Contribution
It presents a novel visual dynamics modeling approach over Gaussian splatting representations specifically designed for granular media, improving manipulation and generalization capabilities.
Findings
Effective in simulated and real-world granular manipulation tasks
Outperforms existing methods in prediction accuracy and manipulation success
Demonstrates zero-shot transfer to new environments
Abstract
Recent advancements in learned 3D representations have enabled significant progress in solving complex robotic manipulation tasks, particularly for rigid-body objects. However, manipulating granular materials such as beans, nuts, and rice, remains challenging due to the intricate physics of particle interactions, high-dimensional and partially observable state, inability to visually track individual particles in a pile, and the computational demands of accurate dynamics prediction. Current deep latent dynamics models often struggle to generalize in granular material manipulation due to a lack of inductive biases. In this work, we propose a novel approach that learns a visual dynamics model over Gaussian splatting representations of scenes and leverages this model for manipulating granular media via Model-Predictive Control. Our method enables efficient optimization for complex…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputer Graphics and Visualization Techniques · Robotics and Sensor-Based Localization · Advanced Vision and Imaging
