Training-free Detection and 6D Pose Estimation of Unseen Surgical Instruments
Jonas Hein, Lilian Calvet, Matthias Seibold, Siyu Tang, Marc Pollefeys, Philipp F\"urnstahl

TL;DR
This paper introduces a training-free, multi-view pipeline for accurate 6D pose estimation of unseen surgical instruments using only a CAD model, combining detection, geometric filtering, and contour refinement.
Contribution
It presents a novel training-free approach that leverages foundational models and multi-view geometry for marker-less, high-accuracy pose estimation of unseen surgical tools.
Findings
Achieves millimeter-accurate pose estimates comparable to supervised methods.
Demonstrates full generalization to unseen surgical instruments.
Enables robust, marker-less instrument tracking in surgical scenes.
Abstract
Purpose: Accurate detection and 6D pose estimation of surgical instruments are crucial for many computer-assisted interventions. However, supervised methods lack flexibility for new or unseen tools and require extensive annotated data. This work introduces a training-free pipeline for accurate multi-view 6D pose estimation of unseen surgical instruments, which only requires a textured CAD model as prior knowledge. Methods: Our pipeline consists of two main stages. First, for detection, we generate object mask proposals in each view and score their similarity to rendered templates using a pre-trained feature extractor. Detections are matched across views, triangulated into 3D instance candidates, and filtered using multi-view geometric consistency. Second, for pose estimation, a set of pose hypotheses is iteratively refined and scored using feature-metric scores with cross-view…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSurgical Simulation and Training · Soft Robotics and Applications · Augmented Reality Applications
