Self-Supervised Depth Estimation in Laparoscopic Image using 3D   Geometric Consistency

Baoru Huang; Jian-Qing Zheng; Anh Nguyen; Chi Xu; Ioannis Gkouzionis,; Kunal Vyas; David Tuch; Stamatia Giannarou; Daniel S. Elson

arXiv:2208.08407·cs.CV·June 22, 2023

Self-Supervised Depth Estimation in Laparoscopic Image using 3D Geometric Consistency

Baoru Huang, Jian-Qing Zheng, Anh Nguyen, Chi Xu, Ioannis Gkouzionis,, Kunal Vyas, David Tuch, Stamatia Giannarou, Daniel S. Elson

PDF

Open Access 3 Repos

TL;DR

This paper introduces M3Depth, a self-supervised depth estimation method for laparoscopic images that leverages 3D geometric consistency in stereo pairs, improving accuracy over previous approaches without requiring ground truth depth.

Contribution

The paper proposes a novel self-supervised depth estimator that utilizes 3D geometric structural information in stereo images, enhancing depth accuracy in laparoscopic imaging.

Findings

01

Outperforms previous self-supervised methods on public and new datasets

02

Demonstrates strong generalization across different laparoscopes

03

Effectively handles border regions to improve correspondence accuracy

Abstract

Depth estimation is a crucial step for image-guided intervention in robotic surgery and laparoscopic imaging system. Since per-pixel depth ground truth is difficult to acquire for laparoscopic image data, it is rarely possible to apply supervised depth estimation to surgical applications. As an alternative, self-supervised methods have been introduced to train depth estimators using only synchronized stereo image pairs. However, most recent work focused on the left-right consistency in 2D and ignored valuable inherent 3D information on the object in real world coordinates, meaning that the left-right 3D geometric structural consistency is not fully utilized. To overcome this limitation, we present M3Depth, a self-supervised depth estimator to leverage 3D geometric structural information hidden in stereo pairs while keeping monocular inference. The method also removes the influence of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Image Processing Techniques and Applications · Robotics and Sensor-Based Localization