OnlineSplatter: Pose-Free Online 3D Reconstruction for Free-Moving Objects
Mark He Huang, Lin Geng Foo, Christian Theobalt, Ying Sun, De Wen Soh

TL;DR
OnlineSplatter is a real-time, pose-free 3D reconstruction method for free-moving objects from monocular video, using a novel memory module to achieve high-quality results without pose or depth information.
Contribution
It introduces a dual-key memory module and a dense Gaussian primitive field for online, pose-free 3D reconstruction that maintains constant computational cost.
Findings
Outperforms state-of-the-art pose-free reconstruction methods
Maintains constant memory and runtime regardless of sequence length
Improves reconstruction quality with more observations
Abstract
Free-moving object reconstruction from monocular video remains challenging, particularly without reliable pose or depth cues and under arbitrary object motion. We introduce OnlineSplatter, a novel online feed-forward framework generating high-quality, object-centric 3D Gaussians directly from RGB frames without requiring camera pose, depth priors, or bundle optimization. Our approach anchors reconstruction using the first frame and progressively refines the object representation through a dense Gaussian primitive field, maintaining constant computational cost regardless of video sequence length. Our core contribution is a dual-key memory module combining latent appearance-geometry keys with explicit directional keys, robustly fusing current frame features with temporally aggregated object states. This design enables effective handling of free-moving objects via spatial-guided memory…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsHuman Pose and Action Recognition · Advanced Vision and Imaging · Robotics and Sensor-Based Localization
