DeformGS: Scene Flow in Highly Deformable Scenes for Deformable Object   Manipulation

Bardienus P. Duisterhof; Zhao Mandi; Yunchao Yao; Jia-Wei Liu; Jenny; Seidenschwarz; Mike Zheng Shou; Deva Ramanan; Shuran Song; Stan Birchfield,; Bowen Wen; Jeffrey Ichnowski

arXiv:2312.00583·cs.CV·September 2, 2024·6 cites

DeformGS: Scene Flow in Highly Deformable Scenes for Deformable Object Manipulation

Bardienus P. Duisterhof, Zhao Mandi, Yunchao Yao, Jia-Wei Liu, Jenny, Seidenschwarz, Mike Zheng Shou, Deva Ramanan, Shuran Song, Stan Birchfield,, Bowen Wen, Jeffrey Ichnowski

PDF

Open Access

TL;DR

DeformGS is a novel method for dense 3D scene flow estimation in highly deformable scenes, enabling improved robotic manipulation of objects like cloth by leveraging multi-view video, Gaussian splatting, and physics-inspired regularization.

Contribution

It introduces DeformGS, a new approach combining Gaussian splatting, neural-voxel encoding, and physics-based regularization for accurate scene flow in deformable objects.

Findings

01

DeformGS improves 3D tracking accuracy by 55.8% over previous methods.

02

Achieves a median tracking error of 3.3 mm on a large cloth scene.

03

Performs well even with shadows and occlusions in complex scenes.

Abstract

Teaching robots to fold, drape, or reposition deformable objects such as cloth will unlock a variety of automation applications. While remarkable progress has been made for rigid object manipulation, manipulating deformable objects poses unique challenges, including frequent occlusions, infinite-dimensional state spaces and complex dynamics. Just as object pose estimation and tracking have aided robots for rigid manipulation, dense 3D tracking (scene flow) of highly deformable objects will enable new applications in robotics while aiding existing approaches, such as imitation learning or creating digital twins with real2sim transfer. We propose DeformGS, an approach to recover scene flow in highly deformable scenes, using simultaneous video captures of a dynamic scene from multiple cameras. DeformGS builds on recent advances in Gaussian splatting, a method that learns the properties of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · 3D Shape Modeling and Analysis · Human Pose and Action Recognition

MethodsSparse Evolutionary Training · Segment Anything Model