TL;DR
This paper introduces an advanced simulation environment for generating realistic surgical instrument images to improve 6D pose estimation, crucial for automation in surgical robotics, by creating large annotated datasets with high accuracy.
Contribution
We developed an automated data generation pipeline and enhanced surgical scene simulation to produce realistic datasets for surgical instrument pose estimation.
Findings
Generated 7.5k annotated images for surgical needle pose estimation.
Achieved a mean translational error of 2.59mm on occluded datasets.
Demonstrated the pipeline's effectiveness for training vision algorithms.
Abstract
Automation in surgical robotics has the potential to improve patient safety and surgical efficiency, but it is difficult to achieve due to the need for robust perception algorithms. In particular, 6D pose estimation of surgical instruments is critical to enable the automatic execution of surgical maneuvers based on visual feedback. In recent years, supervised deep learning algorithms have shown increasingly better performance at 6D pose estimation tasks; yet, their success depends on the availability of large amounts of annotated data. In household and industrial settings, synthetic data, generated with 3D computer graphics software, has been shown as an alternative to minimize annotation costs of 6D pose datasets. However, this strategy does not translate well to surgical domains as commercial graphics software have limited tools to generate images depicting realistic instrument-tissue…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
