Investigating a Policy-Based Formulation for Endoscopic Camera Pose Recovery

Jan Emily Mangulabnan; Akshat Chauhan; Laura Fleig; Lalithkumar Seenivasan; Roger D. Soberanis-Mukul; S. Swaroop Vedula; Russell H. Taylor; Masaru Ishii; Gregory D. Hager; Mathias Unberath

arXiv:2603.20045·eess.IV·March 23, 2026

Investigating a Policy-Based Formulation for Endoscopic Camera Pose Recovery

Jan Emily Mangulabnan, Akshat Chauhan, Laura Fleig, Lalithkumar Seenivasan, Roger D. Soberanis-Mukul, S. Swaroop Vedula, Russell H. Taylor, Masaru Ishii, Gregory D. Hager, Mathias Unberath

PDF

Open Access

TL;DR

This paper proposes a policy-based approach for endoscopic camera pose recovery that predicts relative motions directly, addressing limitations of geometric methods under challenging conditions like low texture and illumination changes.

Contribution

It introduces a novel learned policy formulation that imitates expert reasoning, avoiding explicit geometric representations and improving robustness in endoscopic navigation.

Findings

01

Achieves lowest mean translation error compared to geometric baselines.

02

Demonstrates robustness under low-texture and illumination changes.

03

Offers a viable alternative to traditional geometry-based pose estimation.

Abstract

In endoscopic surgery, surgeons continuously locate the endoscopic view relative to the anatomy by interpreting the evolving visual appearance of the intraoperative scene in the context of their prior knowledge. Vision-based navigation systems seek to replicate this capability by recovering camera pose directly from endoscopic video, but most approaches do not embody the same principles of reasoning about new frames that makes surgeons successful. Instead, they remain grounded in feature matching and geometric optimization over keyframes, an approach that has been shown to degrade under the challenging conditions of endoscopic imaging like low texture and rapid illumination changes. Here, we pursue an alternative approach and investigate a policy-based formulation of endoscopic camera pose recovery that seeks to imitate experts in estimating trajectories conditioned on the previous…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSurgical Simulation and Training · Advanced Vision and Imaging · Robotics and Sensor-Based Localization