A Kinematic Bottleneck Approach For Pose Regression of Flexible Surgical   Instruments directly from Images

Luca Sestini; Benoit Rosa; Elena De Momi; Giancarlo Ferrigno and; Nicolas Padoy

arXiv:2103.00586·cs.RO·March 2, 2021

A Kinematic Bottleneck Approach For Pose Regression of Flexible Surgical Instruments directly from Images

Luca Sestini, Benoit Rosa, Elena De Momi, Giancarlo Ferrigno and, Nicolas Padoy

PDF

TL;DR

This paper introduces a self-supervised, real-time image-based method for 3-D pose estimation of flexible surgical instruments, leveraging a kinematic bottleneck and physical model to avoid manual annotations.

Contribution

It presents a novel auto-encoder framework that uses a kinematic bottleneck and physical model for self-supervised training, enabling real-time pose estimation without manual labels.

Findings

01

Validated on semi-synthetic, phantom, and in-vivo datasets

02

Achieved promising real-time 3-D pose estimation results

03

Demonstrated effectiveness for flexible robotic endoscopes

Abstract

3-D pose estimation of instruments is a crucial step towards automatic scene understanding in robotic minimally invasive surgery. Although robotic systems can potentially directly provide joint values, this information is not commonly exploited inside the operating room, due to its possible unreliability, limited access and the time-consuming calibration required, especially for continuum robots. For this reason, standard approaches for 3-D pose estimation involve the use of external tracking systems. Recently, image-based methods have emerged as promising, non-invasive alternatives. While many image-based approaches in the literature have shown accurate results, they generally require either a complex iterative optimization for each processed image, making them unsuitable for real-time applications, or a large number of manually-annotated images for efficient learning. In this paper we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.