A Kinematic Bottleneck Approach For Pose Regression of Flexible Surgical Instruments directly from Images
Luca Sestini, Benoit Rosa, Elena De Momi, Giancarlo Ferrigno and, Nicolas Padoy

TL;DR
This paper introduces a self-supervised, real-time image-based method for 3-D pose estimation of flexible surgical instruments, leveraging a kinematic bottleneck and physical model to avoid manual annotations.
Contribution
It presents a novel auto-encoder framework that uses a kinematic bottleneck and physical model for self-supervised training, enabling real-time pose estimation without manual labels.
Findings
Validated on semi-synthetic, phantom, and in-vivo datasets
Achieved promising real-time 3-D pose estimation results
Demonstrated effectiveness for flexible robotic endoscopes
Abstract
3-D pose estimation of instruments is a crucial step towards automatic scene understanding in robotic minimally invasive surgery. Although robotic systems can potentially directly provide joint values, this information is not commonly exploited inside the operating room, due to its possible unreliability, limited access and the time-consuming calibration required, especially for continuum robots. For this reason, standard approaches for 3-D pose estimation involve the use of external tracking systems. Recently, image-based methods have emerged as promising, non-invasive alternatives. While many image-based approaches in the literature have shown accurate results, they generally require either a complex iterative optimization for each processed image, making them unsuitable for real-time applications, or a large number of manually-annotated images for efficient learning. In this paper we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
