A Deep Learning Based 6 Degree-of-Freedom Localization Method for Endoscopic Capsule Robots
Mehmet Turan, Yasin Almalioglu, Ender Konukoglu, Metin Sitti

TL;DR
This paper introduces a deep learning-based 6-DoF localization system for endoscopic capsule robots, achieving real-time pose estimation using only visual data with high accuracy and robustness in complex GI tract environments.
Contribution
A novel 23-layer CNN architecture for real-time 6-DoF localization of capsule robots using monocular endoscopic images, trained on realistic stomach model data.
Findings
Average translation error of 7.1%
Average rotation error of 3.4%
Robust performance under challenging visual conditions
Abstract
We present a robust deep learning based 6 degrees-of-freedom (DoF) localization system for endoscopic capsule robots. Our system mainly focuses on localization of endoscopic capsule robots inside the GI tract using only visual information captured by a mono camera integrated to the robot. The proposed system is a 23-layer deep convolutional neural network (CNN) that is capable to estimate the pose of the robot in real time using a standard CPU. The dataset for the evaluation of the system was recorded inside a surgical human stomach model with realistic surface texture, softness, and surface liquid properties so that the pre-trained CNN architecture can be transferred confidently into a real endoscopic scenario. An average error of 7:1% and 3:4% for translation and rotation has been obtained, respectively. The results accomplished from the experiments demonstrate that a CNN pre-trained…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGastrointestinal Bleeding Diagnosis and Treatment · Soft Robotics and Applications · Surgical Simulation and Training
