DD-VNB: A Depth-based Dual-Loop Framework for Real-time Visually Navigated Bronchoscopy
Qingyao Tian, Huai Liao, Xinyan Huang, Jian Chen, Zihui Zhang, Bingyu, Yang, Sebastien Ourselin, Hongbin Liu

TL;DR
This paper introduces DD-VNB, a depth-based dual-loop framework for real-time bronchoscopic navigation that generalizes across patients without retraining, achieving high accuracy and speed in localization.
Contribution
The study presents a novel depth estimation and localization framework that combines view synthesis, cycle adversarial architecture, and fast ego-motion estimation for real-time bronchoscopic navigation.
Findings
Depth estimation outperforms state-of-the-art methods.
Localization accuracy with ATE of 4.7 mm in phantom and 6.49 mm in patient data.
Achieves near video frame-rate processing without retraining.
Abstract
Real-time 6 DOF localization of bronchoscopes is crucial for enhancing intervention quality. However, current vision-based technologies struggle to balance between generalization to unseen data and computational speed. In this study, we propose a Depth-based Dual-Loop framework for real-time Visually Navigated Bronchoscopy (DD-VNB) that can generalize across patient cases without the need of re-training. The DD-VNB framework integrates two key modules: depth estimation and dual-loop localization. To address the domain gap among patients, we propose a knowledge-embedded depth estimation network that maps endoscope frames to depth, ensuring generalization by eliminating patient-specific textures. The network embeds view synthesis knowledge into a cycle adversarial architecture for scale-constrained monocular depth estimation. For real-time performance, our localization module embeds a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLung Cancer Diagnosis and Treatment · Colorectal Cancer Screening and Detection · AI in cancer detection
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
