Fast Direct Stereo Visual SLAM
Jiawei Mo, Md Jahidul Islam, and Junaed Sattar

TL;DR
This paper introduces a fast, robust stereo visual SLAM system that avoids feature detection and matching by extending direct sparse odometry to stereo, incorporating loop closure, and optimizing for accuracy and efficiency.
Contribution
It presents a novel stereo SLAM method based on direct sparse odometry, with loop closure and LiDAR-inspired place recognition, achieving high efficiency without feature matching.
Findings
Outperforms state-of-the-art methods in accuracy and speed
Demonstrates robustness in various datasets
Efficient loop closure detection and pose optimization
Abstract
We propose a novel approach for fast and accurate stereo visual Simultaneous Localization and Mapping (SLAM) independent of feature detection and matching. We extend monocular Direct Sparse Odometry (DSO) to a stereo system by optimizing the scale of the 3D points to minimize photometric error for the stereo configuration, which yields a computationally efficient and robust method compared to conventional stereo matching. We further extend it to a full SLAM system with loop closure to reduce accumulated errors. With the assumption of forward camera motion, we imitate a LiDAR scan using the 3D points obtained from the visual odometry and adapt a LiDAR descriptor for place recognition to facilitate more efficient detection of loop closures. Afterward, we estimate the relative pose using direct alignment by minimizing the photometric error for potential loop closures. Optionally, further…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Vision and Imaging · Advanced Image and Video Retrieval Techniques
