4D Monocular Surgical Reconstruction under Arbitrary Camera Motions
Jiwei Shan, Zeyu Cai, Cheng-Tai Hsieh, Yirui Li, Hao Liu, Lijun Han, Hesheng Wang, Shing Shin Cheng

TL;DR
This paper introduces Local-EndoGS, a novel 4D reconstruction framework for monocular endoscopic videos with arbitrary camera motion, overcoming initialization challenges and improving scene reconstruction quality in clinical settings.
Contribution
The paper presents a scalable, window-based 4D reconstruction method that integrates multi-view geometry and monocular priors, enabling robust deformable scene reconstruction with large camera motions.
Findings
Outperforms state-of-the-art methods in appearance and geometry quality.
Effective in handling long sequences with substantial camera motion.
Validated on three public endoscopic datasets.
Abstract
Reconstructing deformable surgical scenes from endoscopic videos is challenging and clinically important. Recent state-of-the-art methods based on implicit neural representations or 3D Gaussian splatting have made notable progress. However, most are designed for deformable scenes with fixed endoscope viewpoints and rely on stereo depth priors or accurate structure-from-motion for initialization and optimization, limiting their ability to handle monocular sequences with large camera motion in real clinical settings. To address this, we propose Local-EndoGS, a high-quality 4D reconstruction framework for monocular endoscopic sequences with arbitrary camera motion. Local-EndoGS introduces a progressive, window-based global representation that allocates local deformable scene models to each observed window, enabling scalability to long sequences with substantial motion. To overcome…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · 3D Shape Modeling and Analysis · Computer Graphics and Visualization Techniques
