EndoSparse: Real-Time Sparse View Synthesis of Endoscopic Scenes using Gaussian Splatting
Chenxin Li, Brandon Y. Feng, Yifan Liu, Hengyu Liu, Cheng Wang, Weihao, Yu, Yixuan Yuan

TL;DR
EndoSparse introduces a novel framework that leverages foundation models to improve real-time 3D reconstruction and view synthesis of endoscopic scenes from sparse observations, enhancing geometric accuracy and rendering quality.
Contribution
It is the first to integrate foundation model priors into endoscopic 3D reconstruction, significantly improving results under sparse-view conditions.
Findings
Outperforms state-of-the-art methods in geometry and appearance quality.
Achieves robust 3D reconstruction with only three views.
Demonstrates real-time rendering efficiency.
Abstract
3D reconstruction of biological tissues from a collection of endoscopic images is a key to unlock various important downstream surgical applications with 3D capabilities. Existing methods employ various advanced neural rendering techniques for photorealistic view synthesis, but they often struggle to recover accurate 3D representations when only sparse observations are available, which is usually the case in real-world clinical scenarios. To tackle this {sparsity} challenge, we propose a framework leveraging the prior knowledge from multiple foundation models during the reconstruction process, dubbed as \textit{EndoSparse}. Experimental results indicate that our proposed strategy significantly improves the geometric and appearance quality under challenging sparse-view conditions, including using only three views. In rigorous benchmarking experiments against state-of-the-art methods,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques
