Deep EndoVO: A Recurrent Convolutional Neural Network (RCNN) based Visual Odometry Approach for Endoscopic Capsule Robots
Mehmet Turan, Yasin Almalioglu, Helder Araujo, Ender Konukoglu, Metin, Sitti

TL;DR
This paper introduces Deep EndoVO, a deep RCNN-based visual odometry method for endoscopic capsule robots, enabling accurate real-time pose estimation crucial for active endoscopic navigation.
Contribution
It presents a novel deep RCNN approach combining CNNs and RNNs for monocular visual odometry in endoscopic capsules, improving accuracy over existing methods.
Findings
Achieves high translational accuracy in pig stomach tests
Demonstrates precise rotational pose estimation
Validates effectiveness on real endoscopic data
Abstract
Ingestible wireless capsule endoscopy is an emerging minimally invasive diagnostic technology for inspection of the GI tract and diagnosis of a wide range of diseases and pathologies. Medical device companies and many research groups have recently made substantial progresses in converting passive capsule endoscopes to active capsule robots, enabling more accurate, precise, and intuitive detection of the location and size of the diseased areas. Since a reliable real time pose estimation functionality is crucial for actively controlled endoscopic capsule robots, in this study, we propose a monocular visual odometry (VO) method for endoscopic capsule robot operations. Our method lies on the application of the deep Recurrent Convolutional Neural Networks (RCNNs) for the visual odometry task, where Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs) are used for the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
