CNN-Augmented Visual-Inertial SLAM with Planar Constraints
Pan Ji, Yuan Tian, Qingan Yan, Yuxin Ma, and Yi Xu

TL;DR
This paper introduces a robust visual-inertial SLAM system that integrates CNN-based depth prediction, uncertainty weighting, and planar constraints to enhance localization accuracy and robustness.
Contribution
It combines CNN-predicted depth and uncertainty with planar constraints for improved SLAM performance, including a novel fast plane detection method.
Findings
Improved SLAM accuracy on EuRoC dataset
Effective integration of CNN depth and uncertainty
Enhanced plane detection for SLAM regularization
Abstract
We present a robust visual-inertial SLAM system that combines the benefits of Convolutional Neural Networks (CNNs) and planar constraints. Our system leverages a CNN to predict the depth map and the corresponding uncertainty map for each image. The CNN depth effectively bootstraps the back-end optimization of SLAM and meanwhile the CNN uncertainty adaptively weighs the contribution of each feature point to the back-end optimization. Given the gravity direction from the inertial sensor, we further present a fast plane detection method that detects horizontal planes via one-point RANSAC and vertical planes via two-point RANSAC. Those stably detected planes are in turn used to regularize the back-end optimization of SLAM. We evaluate our system on a public dataset, \ie, EuRoC, and demonstrate improved results over a state-of-the-art SLAM system, \ie, ORB-SLAM3.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Vision and Imaging · Advanced Image and Video Retrieval Techniques
MethodsGravity
