Self-Supervised Depth Estimation in Laparoscopic Image using 3D Geometric Consistency
Baoru Huang, Jian-Qing Zheng, Anh Nguyen, Chi Xu, Ioannis Gkouzionis,, Kunal Vyas, David Tuch, Stamatia Giannarou, Daniel S. Elson

TL;DR
This paper introduces M3Depth, a self-supervised depth estimation method for laparoscopic images that leverages 3D geometric consistency in stereo pairs, improving accuracy over previous approaches without requiring ground truth depth.
Contribution
The paper proposes a novel self-supervised depth estimator that utilizes 3D geometric structural information in stereo images, enhancing depth accuracy in laparoscopic imaging.
Findings
Outperforms previous self-supervised methods on public and new datasets
Demonstrates strong generalization across different laparoscopes
Effectively handles border regions to improve correspondence accuracy
Abstract
Depth estimation is a crucial step for image-guided intervention in robotic surgery and laparoscopic imaging system. Since per-pixel depth ground truth is difficult to acquire for laparoscopic image data, it is rarely possible to apply supervised depth estimation to surgical applications. As an alternative, self-supervised methods have been introduced to train depth estimators using only synchronized stereo image pairs. However, most recent work focused on the left-right consistency in 2D and ignored valuable inherent 3D information on the object in real world coordinates, meaning that the left-right 3D geometric structural consistency is not fully utilized. To overcome this limitation, we present M3Depth, a self-supervised depth estimator to leverage 3D geometric structural information hidden in stereo pairs while keeping monocular inference. The method also removes the influence of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Image Processing Techniques and Applications · Robotics and Sensor-Based Localization
