DeepRelativeFusion: Dense Monocular SLAM using Single-Image Relative Depth Prediction
Shing Yan Loo, Syamsiah Mashohor, Sai Hong Tang, Hong Zhang

TL;DR
DeepRelativeFusion is a dense monocular SLAM system that integrates relative depth prediction and adaptive filtering to improve 3D scene reconstruction accuracy, outperforming existing methods.
Contribution
It introduces a novel densification approach using single-image relative depth prediction and an adaptive filtering scheme within a SLAM framework.
Findings
Significantly improves dense reconstruction accuracy.
Outperforms state-of-the-art dense SLAM systems.
Provides a feedback loop for pose refinement.
Abstract
In this paper, we propose a dense monocular SLAM system, named DeepRelativeFusion, that is capable to recover a globally consistent 3D structure. To this end, we use a visual SLAM algorithm to reliably recover the camera poses and semi-dense depth maps of the keyframes, and then use relative depth prediction to densify the semi-dense depth maps and refine the keyframe pose-graph. To improve the semi-dense depth maps, we propose an adaptive filtering scheme, which is a structure-preserving weighted average smoothing filter that takes into account the pixel intensity and depth of the neighbouring pixels, yielding substantial reconstruction accuracy gain in densification. To perform densification, we introduce two incremental improvements upon the energy minimization framework proposed by DeepFusion: (1) an improved cost function, and (2) the use of single-image relative depth prediction.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Robotics and Sensor-Based Localization · Optical measurement and interference techniques
