GAF: Gaussian Action Field as a 4D Representation for Dynamic World Modeling in Robotic Manipulation
Ying Chai, Litao Deng, Ruizhi Shao, Jiajun Zhang, Kangchen Lv, Liangjun Xing, Xiang Li, Hongwen Zhang, Yebin Liu

TL;DR
This paper introduces GAF, a 4D scene representation that models dynamic scenes and actions for robotic manipulation, improving accuracy and success rates over existing methods.
Contribution
It proposes the Gaussian Action Field (GAF), a novel 4D representation that incorporates motion attributes for direct action reasoning in dynamic scenes.
Findings
GAF improves scene reconstruction quality significantly.
GAF boosts robotic manipulation success rate by 7.3%.
The method outperforms state-of-the-art in dynamic scene modeling.
Abstract
Accurate scene perception is critical for vision-based robotic manipulation. Existing approaches typically follow either a Vision-to-Action (V-A) paradigm, predicting actions directly from visual inputs, or a Vision-to-3D-to-Action (V-3D-A) paradigm, leveraging intermediate 3D representations. However, these methods often struggle with action inaccuracies due to the complexity and dynamic nature of manipulation scenes. In this paper, we adopt a V-4D-A framework that enables direct action reasoning from motion-aware 4D representations via a Gaussian Action Field (GAF). GAF extends 3D Gaussian Splatting (3DGS) by incorporating learnable motion attributes, allowing 4D modeling of dynamic scenes and manipulation actions. To learn time-varying scene geometry and action-aware robot motion, GAF provides three interrelated outputs: reconstruction of the current scene, prediction of future…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing and 3D Reconstruction · Robot Manipulation and Learning · Simulation Techniques and Applications
MethodsDiffusion
