BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown Objects
Bowen Wen, Jonathan Tremblay, Valts Blukis, Stephen Tyree, Thomas, Muller, Alex Evans, Dieter Fox, Jan Kautz, Stan Birchfield

TL;DR
BundleSDF introduces a near real-time neural method for 6-DoF tracking and 3D reconstruction of unknown objects from monocular RGBD videos, effective even with untextured surfaces and occlusions.
Contribution
It is the first to combine neural 3D reconstruction with 6-DoF tracking for arbitrary objects using only initial segmentation and no prior info.
Findings
Outperforms existing methods on multiple datasets
Handles large pose changes and occlusions effectively
Works with untextured and reflective surfaces
Abstract
We present a near real-time method for 6-DoF tracking of an unknown object from a monocular RGBD video sequence, while simultaneously performing neural 3D reconstruction of the object. Our method works for arbitrary rigid objects, even when visual texture is largely absent. The object is assumed to be segmented in the first frame only. No additional information is required, and no assumption is made about the interaction agent. Key to our method is a Neural Object Field that is learned concurrently with a pose graph optimization process in order to robustly accumulate information into a consistent 3D representation capturing both geometry and appearance. A dynamic pool of posed memory frames is automatically maintained to facilitate communication between these threads. Our approach handles challenging sequences with large pose changes, partial and full occlusion, untextured surfaces,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Advanced Neural Network Applications · Robotics and Sensor-Based Localization
