BodySLAM: A Generalized Monocular Visual SLAM Framework for Surgical Applications
G. Manni (1, 2), C. Lauretti (2), F. Prata (3), R. Papalia (3), L., Zollo (2), P. Soda (1) ((1) Research Unit of Computer Systems and, Bioinformatics Department of Engineering Universit\`a Campus Bio-Medico di, Roma, (2) Unit of Advanced Robotics

TL;DR
BodySLAM introduces a deep learning-based monocular SLAM framework tailored for endoscopic surgery, combining novel pose estimation, depth prediction, and 3D mapping to improve surgical navigation without additional hardware.
Contribution
It presents a comprehensive MVSLAM system with novel components like CycleVO and integrates state-of-the-art depth estimation, addressing hardware limitations in endoscopic procedures.
Findings
CycleVO achieved low inference time and robust generalization.
Zoe outperformed existing depth estimation algorithms.
BodySLAM demonstrated strong performance across diverse endoscopic datasets.
Abstract
Endoscopic surgery relies on two-dimensional views, posing challenges for surgeons in depth perception and instrument manipulation. While Monocular Visual Simultaneous Localization and Mapping (MVSLAM) has emerged as a promising solution, its implementation in endoscopic procedures faces significant challenges due to hardware limitations, such as the use of a monocular camera and the absence of odometry sensors. This study presents BodySLAM, a robust deep learning-based MVSLAM approach that addresses these challenges through three key components: CycleVO, a novel unsupervised monocular pose estimation module; the integration of the state-of-the-art Zoe architecture for monocular depth estimation; and a 3D reconstruction module creating a coherent surgical map. The approach is rigorously evaluated using three publicly available datasets (Hamlyn, EndoSLAM, and SCARED) spanning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAugmented Reality Applications · Surgical Simulation and Training · Anatomy and Medical Technology
MethodsHuMan(Expedia)||How do I get a human at Expedia? · *Communicated@Fast*How Do I Communicate to Expedia? · Instance Normalization · Batch Normalization · PatchGAN · Residual Connection · Sigmoid Activation · Cycle Consistency Loss · GAN Least Squares Loss · Residual Block
