Incorporating Learnt Local and Global Embeddings into Monocular Visual SLAM
Huaiyang Huang, Haoyang Ye, Yuxiang Sun, Lujia Wang, Ming Liu

TL;DR
This paper introduces a monocular VSLAM system that integrates learned local features and global embeddings to enhance robustness and accuracy, especially under challenging conditions like varying illumination.
Contribution
It presents a novel VSLAM system that fully exploits learned features and global embeddings at multiple modules, improving robustness and accuracy over traditional methods.
Findings
Outperforms state-of-the-art methods on public datasets
Enhances robustness in challenging lighting conditions
Achieves competitive camera pose estimation accuracy
Abstract
Traditional approaches for Visual Simultaneous Localization and Mapping (VSLAM) rely on low-level vision information for state estimation, such as handcrafted local features or the image gradient. While significant progress has been made through this track, under more challenging configuration for monocular VSLAM, e.g., varying illumination, the performance of state-of-the-art systems generally degrades. As a consequence, robustness and accuracy for monocular VSLAM are still widely concerned. This paper presents a monocular VSLAM system that fully exploits learnt features for better state estimation. The proposed system leverages both learnt local features and global embeddings at different modules of the system: direct camera pose estimation, inter-frame feature association, and loop closure detection. With a probabilistic explanation of keypoint prediction, we formulate the camera…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Image and Video Retrieval Techniques · Advanced Vision and Imaging
