BundleFusion: Real-time Globally Consistent 3D Reconstruction using On-the-fly Surface Re-integration
Angela Dai, Matthias Nie{\ss}ner, Michael Zollh\"ofer and, Shahram Izadi, Christian Theobalt

TL;DR
BundleFusion introduces a real-time, globally consistent 3D reconstruction system that significantly improves accuracy, robustness, and speed over previous methods, enabling high-quality large-scale scene scanning in real-time.
Contribution
It presents a novel end-to-end framework with a hierarchical pose optimization, relocalization, and real-time model re-estimation, surpassing prior online methods in quality and speed.
Findings
Achieves real-time globally optimized pose estimation.
Supports robust relocalization after tracking failures.
Produces high-quality, complete 3D scans comparable to offline methods.
Abstract
Real-time, high-quality, 3D scanning of large-scale scenes is key to mixed reality and robotic applications. However, scalability brings challenges of drift in pose estimation, introducing significant errors in the accumulated model. Approaches often require hours of offline processing to globally correct model errors. Recent online methods demonstrate compelling results, but suffer from: (1) needing minutes to perform online correction preventing true real-time use; (2) brittle frame-to-frame (or frame-to-model) pose estimation resulting in many tracking failures; or (3) supporting only unstructured point-based representations, which limit scan quality and applicability. We systematically address these issues with a novel, real-time, end-to-end reconstruction framework. At its core is a robust pose estimation strategy, optimizing per frame for a global set of camera poses by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · 3D Surveying and Cultural Heritage · Augmented Reality Applications
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
