CudaSIFT-SLAM: multiple-map visual SLAM for full procedure mapping in real human endoscopy
Richard Elvira, Juan D. Tard\'os, Jos\'e M.M. Montiel

TL;DR
This paper introduces CudaSIFT-SLAM, a real-time visual SLAM system for human endoscopy that overcomes previous limitations by using GPU-accelerated SIFT features, enabling effective map merging and relocalization in challenging colonoscopy environments.
Contribution
The novel system employs SIFT features and brute-force matching with GPU acceleration to enable real-time, full-procedure mapping in human colonoscopies, surpassing prior methods like ORB-SLAM3.
Findings
Successfully maps 88% of frames in phantom datasets
Achieves 70% improvement in mapping coverage over ORB-SLAM3 in real colonoscopy
Enables map merging and relocalization in challenging endoscopy conditions
Abstract
Monocular visual simultaneous localization and mapping (V-SLAM) is nowadays an irreplaceable tool in mobile robotics and augmented reality, where it performs robustly. However, human colonoscopies pose formidable challenges like occlusions, blur, light changes, lack of texture, deformation, water jets or tool interaction, which result in very frequent tracking losses. ORB-SLAM3, the top performing multiple-map V-SLAM, is unable to recover from them by merging sub-maps or relocalizing the camera, due to the poor performance of its place recognition algorithm based on ORB features and DBoW2 bag-of-words. We present CudaSIFT-SLAM, the first V-SLAM system able to process complete human colonoscopies in real-time. To overcome the limitations of ORB-SLAM3, we use SIFT instead of ORB features and replace the DBoW2 direct index with the more computationally demanding brute-force matching,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Robotics and Sensor-Based Localization · Augmented Reality Applications
