TL;DR
ORB-SLAM3 is a versatile, accurate, open-source SLAM system capable of visual, visual-inertial, and multi-map operation across various camera types and environments, with robust long-term mapping and reuse of previous information.
Contribution
It introduces a feature-based tightly-integrated visual-inertial SLAM system relying on MAP estimation and a novel multi-map approach with improved place recognition, enabling long-term robustness and high accuracy.
Findings
Operates robustly in real-time across diverse environments.
Achieves 2 to 5 times higher accuracy than previous methods.
Successfully handles long-term mapping with map merging and reuse.
Abstract
This paper presents ORB-SLAM3, the first system able to perform visual, visual-inertial and multi-map SLAM with monocular, stereo and RGB-D cameras, using pin-hole and fisheye lens models. The first main novelty is a feature-based tightly-integrated visual-inertial SLAM system that fully relies on Maximum-a-Posteriori (MAP) estimation, even during the IMU initialization phase. The result is a system that operates robustly in real-time, in small and large, indoor and outdoor environments, and is 2 to 5 times more accurate than previous approaches. The second main novelty is a multiple map system that relies on a new place recognition method with improved recall. Thanks to it, ORB-SLAM3 is able to survive to long periods of poor visual information: when it gets lost, it starts a new map that will be seamlessly merged with previous maps when revisiting mapped areas. Compared with visual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
