4DGS360: 360{\deg} Gaussian Reconstruction of Dynamic Objects from a Single Video
Jae Won Jang, Yeonjin Chang, Wonsik Shin, Juhwan Cho, and Nojun Kwak

TL;DR
4DGS360 is a novel framework for reconstructing 360-degree dynamic objects from monocular videos, overcoming geometric ambiguities with advanced 3D initialization and tracking, and demonstrating superior performance on new and existing datasets.
Contribution
The paper introduces 4DGS360, a diffusion-free method with a new 3D-native initialization and AnchorTAP3D tracker for accurate 360-degree dynamic object reconstruction from monocular videos.
Findings
Achieves state-of-the-art results on multiple datasets.
Effectively handles occlusions and geometric ambiguities.
Introduces iPhone360 benchmark for 360-degree evaluation.
Abstract
We introduce 4DGS360, a diffusion-free framework for 360 dynamic object reconstruction from casual monocular video. Existing methods often fail to reconstruct consistent 360 geometry, as their heavy reliance on 2D-native priors causes initial points to overfit to visible surface in each training view. 4DGS360 addresses this challenge through a advanced 3D-native initialization that mitigates the geometric ambiguity of occluded regions. Our proposed 3D tracker, AnchorTAP3D, produces reinforced 3D point trajectories by leveraging confident 2D track points as anchors, suppressing drift and providing reliable initialization that preserves geometry in occluded regions. This initialization, combined with optimization, yields coherent 360 4D reconstructions. We further present iPhone360, a new benchmark where test cameras are placed up to 135 apart from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Face recognition and analysis · Robotics and Sensor-Based Localization
