SoftEnNet: Symbiotic Monocular Depth Estimation and Lumen Segmentation for Colonoscopy Endorobots
Alwyn Mathew, Ludovic Magerand, Emanuele Trucco, Luigi Manfredi

TL;DR
This paper introduces SoftEnNet, a multi-task deep learning model that jointly estimates depth and segments the lumen in colonoscopy images, enhancing autonomous navigation and screening accuracy.
Contribution
The novel multi-task model integrates depth estimation and lumen segmentation with mutual learning, addressing challenges unique to colonoscopy imaging.
Findings
Accurately predicts scale-invariant depth maps.
Provides precise lumen segmentation in real-time.
Addresses deformable wall and textureless surface challenges.
Abstract
Colorectal cancer is the third most common cause of cancer death worldwide. Optical colonoscopy is the gold standard for detecting colorectal cancer; however, about 25 percent of polyps are missed during the procedure. A vision-based autonomous endorobot can improve colonoscopy procedures significantly through systematic, complete screening of the colonic mucosa. The reliable robot navigation needed requires a three-dimensional understanding of the environment and lumen tracking to support autonomous tasks. We propose a novel multi-task model that simultaneously predicts dense depth and lumen segmentation with an ensemble of deep networks. The depth estimation sub-network is trained in a self-supervised fashion guided by view synthesis; the lumen segmentation sub-network is supervised. The two sub-networks are interconnected with pathways that enable information exchange and thereby…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Advanced Vision and Imaging · Robotics and Sensor-Based Localization
