Incorporating Learnt Local and Global Embeddings into Monocular Visual   SLAM

Huaiyang Huang; Haoyang Ye; Yuxiang Sun; Lujia Wang; Ming Liu

arXiv:2108.02028·cs.RO·August 5, 2021

Incorporating Learnt Local and Global Embeddings into Monocular Visual SLAM

Huaiyang Huang, Haoyang Ye, Yuxiang Sun, Lujia Wang, Ming Liu

PDF

Open Access

TL;DR

This paper introduces a monocular VSLAM system that integrates learned local features and global embeddings to enhance robustness and accuracy, especially under challenging conditions like varying illumination.

Contribution

It presents a novel VSLAM system that fully exploits learned features and global embeddings at multiple modules, improving robustness and accuracy over traditional methods.

Findings

01

Outperforms state-of-the-art methods on public datasets

02

Enhances robustness in challenging lighting conditions

03

Achieves competitive camera pose estimation accuracy

Abstract

Traditional approaches for Visual Simultaneous Localization and Mapping (VSLAM) rely on low-level vision information for state estimation, such as handcrafted local features or the image gradient. While significant progress has been made through this track, under more challenging configuration for monocular VSLAM, e.g., varying illumination, the performance of state-of-the-art systems generally degrades. As a consequence, robustness and accuracy for monocular VSLAM are still widely concerned. This paper presents a monocular VSLAM system that fully exploits learnt features for better state estimation. The proposed system leverages both learnt local features and global embeddings at different modules of the system: direct camera pose estimation, inter-frame feature association, and loop closure detection. With a probabilistic explanation of keypoint prediction, we formulate the camera…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization · Advanced Image and Video Retrieval Techniques · Advanced Vision and Imaging