INGeo: Accelerating Instant Neural Scene Reconstruction with Noisy Geometry Priors
Chaojian Li, Bichen Wu, Albert Pumarola, Peizhao Zhang, Yingyan Celine, Lin, and Peter Vajda

TL;DR
This paper introduces INGeo, a method that leverages noisy geometry priors to significantly accelerate 3D scene reconstruction, aiming for instant training on edge devices like mobile phones and AR/VR headsets.
Contribution
It proposes strategies to mitigate noise in geometry priors, enabling faster training of neural scene reconstruction models on resource-constrained devices.
Findings
Achieves >30 PSNR with half the training iterations on NeRF Synthetic dataset.
Accelerates training speed beyond existing methods on edge devices.
Effectively handles noisy geometry priors to improve reconstruction speed.
Abstract
We present a method that accelerates reconstruction of 3D scenes and objects, aiming to enable instant reconstruction on edge devices such as mobile phones and AR/VR headsets. While recent works have accelerated scene reconstruction training to minute/second-level on high-end GPUs, there is still a large gap to the goal of instant training on edge devices which is yet highly desired in many emerging applications such as immersive AR/VR. To this end, this work aims to further accelerate training by leveraging geometry priors of the target scene. Our method proposes strategies to alleviate the noise of the imperfect geometry priors to accelerate the training speed on top of the highly optimized Instant-NGP. On the NeRF Synthetic dataset, our work uses half of the training iterations to reach an average test PSNR of >30.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · 3D Shape Modeling and Analysis · Human Pose and Action Recognition
MethodsTest · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
