Deep EndoVO: A Recurrent Convolutional Neural Network (RCNN) based   Visual Odometry Approach for Endoscopic Capsule Robots

Mehmet Turan; Yasin Almalioglu; Helder Araujo; Ender Konukoglu; Metin; Sitti

arXiv:1708.06822·cs.CV·November 21, 2017

Deep EndoVO: A Recurrent Convolutional Neural Network (RCNN) based Visual Odometry Approach for Endoscopic Capsule Robots

Mehmet Turan, Yasin Almalioglu, Helder Araujo, Ender Konukoglu, Metin, Sitti

PDF

TL;DR

This paper introduces Deep EndoVO, a deep RCNN-based visual odometry method for endoscopic capsule robots, enabling accurate real-time pose estimation crucial for active endoscopic navigation.

Contribution

It presents a novel deep RCNN approach combining CNNs and RNNs for monocular visual odometry in endoscopic capsules, improving accuracy over existing methods.

Findings

01

Achieves high translational accuracy in pig stomach tests

02

Demonstrates precise rotational pose estimation

03

Validates effectiveness on real endoscopic data

Abstract

Ingestible wireless capsule endoscopy is an emerging minimally invasive diagnostic technology for inspection of the GI tract and diagnosis of a wide range of diseases and pathologies. Medical device companies and many research groups have recently made substantial progresses in converting passive capsule endoscopes to active capsule robots, enabling more accurate, precise, and intuitive detection of the location and size of the diseased areas. Since a reliable real time pose estimation functionality is crucial for actively controlled endoscopic capsule robots, in this study, we propose a monocular visual odometry (VO) method for endoscopic capsule robot operations. Our method lies on the application of the deep Recurrent Convolutional Neural Networks (RCNNs) for the visual odometry task, where Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs) are used for the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.