DARES: Depth Anything in Robotic Endoscopic Surgery with Self-supervised Vector-LoRA of the Foundation Model
Mona Sheikh Zeinoddin, Chiara Lena, Jiongqi Qu, Luca Carlini, Mattia, Magro, Seunghoi Kim, Elena De Momi, Sophia Bano, Matthew Grech-Sollars,, Evangelos Mazomenos, Daniel C. Alexander, Danail Stoyanov, Matthew J., Clarkson, Mobarakol Islam

TL;DR
This paper introduces DARES, a novel self-supervised depth estimation method for robotic endoscopic surgery that adapts foundation models using Vector-LoRA, improving accuracy and robustness in surgical scenes.
Contribution
The paper proposes Vector-LoRA, a new adaptation technique that enhances foundation models for depth estimation in surgical environments, addressing overfitting and feature hierarchy issues.
Findings
Achieves 13.3% improvement in absolute relative error on SCARED dataset.
Demonstrates superior performance over recent state-of-the-art methods.
Validates effectiveness of Vector-LoRA in surgical depth estimation.
Abstract
Robotic-assisted surgery (RAS) relies on accurate depth estimation for 3D reconstruction and visualization. While foundation models like Depth Anything Models (DAM) show promise, directly applying them to surgery often yields suboptimal results. Fully fine-tuning on limited surgical data can cause overfitting and catastrophic forgetting, compromising model robustness and generalization. Although Low-Rank Adaptation (LoRA) addresses some adaptation issues, its uniform parameter distribution neglects the inherent feature hierarchy, where earlier layers, learning more general features, require more parameters than later ones. To tackle this issue, we introduce Depth Anything in Robotic Endoscopic Surgery (DARES), a novel approach that employs a new adaptation technique, Vector Low-Rank Adaptation (Vector-LoRA) on the DAM V2 to perform self-supervised monocular depth estimation in RAS…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsColorectal Cancer Screening and Detection · Minimally Invasive Surgical Techniques · Surgical Simulation and Training
