V$^2$-SfMLearner: Learning Monocular Depth and Ego-motion for Multimodal Wireless Capsule Endoscopy
Long Bai, Beilei Cui, Liangyu Wang, Yanheng Li, Shilong Yao, Sishen, Yuan, Yanan Wu, Yang Zhang, Max Q.-H. Meng, Zhen Li, Weiping Ding, Hongliang, Ren

TL;DR
V$^2$-SfMLearner is a novel multimodal deep learning approach that integrates vibration signals with visual data to improve depth and ego-motion estimation in monocular capsule endoscopy, enhancing robustness and clinical applicability.
Contribution
It introduces a multimodal unsupervised learning framework incorporating vibration signals, with a specialized vibration network and Fourier fusion module, to improve depth and motion estimation in capsule endoscopy.
Findings
Outperforms vision-only algorithms in accuracy and robustness.
Effectively mitigates vibration noise through multimodal fusion.
Compatible with existing vision-based methods for clinical use.
Abstract
Deep learning can predict depth maps and capsule ego-motion from capsule endoscopy videos, aiding in 3D scene reconstruction and lesion localization. However, the collisions of the capsule endoscopies within the gastrointestinal tract cause vibration perturbations in the training data. Existing solutions focus solely on vision-based processing, neglecting other auxiliary signals like vibrations that could reduce noise and improve performance. Therefore, we propose V-SfMLearner, a multimodal approach integrating vibration signals into vision-based depth and capsule motion estimation for monocular capsule endoscopy. We construct a multimodal capsule endoscopy dataset containing vibration and visual signals, and our artificial intelligence solution develops an unsupervised method using vision-vibration signals, effectively eliminating vibration perturbations through multimodal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGastrointestinal Bleeding Diagnosis and Treatment
MethodsFocus
