VIP-SLAM: An Efficient Tightly-Coupled RGB-D Visual Inertial Planar SLAM

Danpeng Chen; Shuai Wang; Weijian Xie; Shangjin Zhai; Nan Wang; Hujun; Bao; Guofeng Zhang

arXiv:2207.01158·cs.RO·July 5, 2022

VIP-SLAM: An Efficient Tightly-Coupled RGB-D Visual Inertial Planar SLAM

Danpeng Chen, Shuai Wang, Weijian Xie, Shangjin Zhai, Nan Wang, Hujun, Bao, Guofeng Zhang

PDF

Open Access

TL;DR

VIP-SLAM introduces a lightweight, tightly-coupled RGB-D visual inertial SLAM system that leverages planar structures to significantly reduce computational complexity while maintaining high accuracy, suitable for mobile devices.

Contribution

The paper presents a novel SLAM system that integrates plane information to simplify the map and accelerate bundle adjustment without sacrificing accuracy.

Findings

01

Global bundle adjustment is nearly 2 times faster.

02

The system achieves better accuracy and efficiency compared to traditional sparse point-based SLAM.

03

Effective use of plane structures reduces computational complexity in indoor environments.

Abstract

In this paper, we propose a tightly-coupled SLAM system fused with RGB, Depth, IMU and structured plane information. Traditional sparse points based SLAM systems always maintain a mass of map points to model the environment. Huge number of map points bring us a high computational complexity, making it difficult to be deployed on mobile devices. On the other hand, planes are common structures in man-made environment especially in indoor environments. We usually can use a small number of planes to represent a large scene. So the main purpose of this article is to decrease the high complexity of sparse points based SLAM. We build a lightweight back-end map which consists of a few planes and map points to achieve efficient bundle adjustment (BA) with an equal or better accuracy. We use homography constraints to eliminate the parameters of numerous plane points in the optimization and reduce…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization · Advanced Vision and Imaging · Advanced Image and Video Retrieval Techniques

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings