Towards Full-parameter and Parameter-efficient Self-learning For   Endoscopic Camera Depth Estimation

Shuting Zhao; Chenkang Du; Kristin Qi; Xinrong Chen; and Xinhan Di

arXiv:2410.00979·cs.CV·October 11, 2024

Towards Full-parameter and Parameter-efficient Self-learning For Endoscopic Camera Depth Estimation

Shuting Zhao, Chenkang Du, Kristin Qi, Xinrong Chen, and Xinhan Di

PDF

Open Access

TL;DR

This paper introduces a full-parameter, parameter-efficient learning framework for endoscopic depth estimation, significantly improving performance over existing adaptation methods by optimizing multiple subspaces simultaneously.

Contribution

It proposes a novel two-stage framework that adapts attention, convolution, and MLP subspaces and introduces a memory-efficient optimization for better depth estimation.

Findings

01

Performance improved from 10.2% to 4.1% in key metrics.

02

Initial experiments show significant accuracy gains.

03

Framework outperforms state-of-the-art models.

Abstract

Adaptation methods are developed to adapt depth foundation models to endoscopic depth estimation recently. However, such approaches typically under-perform training since they limit the parameter search to a low-rank subspace and alter the training dynamics. Therefore, we propose a full-parameter and parameter-efficient learning framework for endoscopic depth estimation. At the first stage, the subspace of attention, convolution and multi-layer perception are adapted simultaneously within different sub-spaces. At the second stage, a memory-efficient optimization is proposed for subspace composition and the performance is further improved in the united sub-space. Initial experiments on the SCARED dataset demonstrate that results at the first stage improves the performance from 10.2% to 4.1% for Sq Rel, Abs Rel, RMSE and RMSE log in the comparison with the state-of-the-art models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsColor Science and Applications · Image Retrieval and Classification Techniques · Medical Image Segmentation Techniques

MethodsConvolution