MetaFE-DE: Learning Meta Feature Embedding for Depth Estimation from Monocular Endoscopic Images
Dawei Lu, Deqiang Xiao, Danni Ai, Jingfan Fan, Tianyu Fu, Yucong Lin,, Hong Song, Xujiong Ye, Lei Zhang, Jian Yang

TL;DR
This paper introduces MetaFE-DE, a novel self-supervised approach for monocular endoscopic depth estimation that leverages shared meta feature embeddings to improve accuracy and generalization over existing methods.
Contribution
The paper proposes a new meta feature embedding concept and a two-stage self-supervised learning paradigm utilizing diffusion models for improved depth estimation.
Findings
Outperforms state-of-the-art methods in accuracy
Demonstrates superior generalization across datasets
Validates effectiveness on diverse endoscopic data
Abstract
Depth estimation from monocular endoscopic images presents significant challenges due to the complexity of endoscopic surgery, such as irregular shapes of human soft tissues, as well as variations in lighting conditions. Existing methods primarily estimate the depth information from RGB images directly, and often surffer the limited interpretability and accuracy. Given that RGB and depth images are two views of the same endoscopic surgery scene, in this paper, we introduce a novel concept referred as ``meta feature embedding (MetaFE)", in which the physical entities (e.g., tissues and surgical instruments) of endoscopic surgery are represented using the shared features that can be alternatively decoded into RGB or depth image. With this concept, we propose a two-stage self-supervised learning paradigm for the monocular endoscopic depth estimation. In the first stage, we propose a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsColorectal Cancer Screening and Detection
MethodsDiffusion
