TL;DR
This paper introduces a semi-synthetic data generation method for training surgical instrument segmentation models without manual annotations, leveraging chroma key techniques and novel augmentation to achieve comparable performance to manually labeled datasets.
Contribution
The authors propose a semi-synthetic dataset creation approach using chroma key and innovative blending methods, eliminating the need for manual pixel-wise annotations in surgical tool segmentation.
Findings
Semi-synthetic data training matches real dataset performance.
Simple post-processing improves segmentation accuracy.
Method reduces manual labeling effort significantly.
Abstract
Producing manual, pixel-accurate, image segmentation labels is tedious and time-consuming. This is often a rate-limiting factor when large amounts of labeled images are required, such as for training deep convolutional networks for instrument-background segmentation in surgical scenes. No large datasets comparable to industry standards in the computer vision community are available for this task. To circumvent this problem, we propose to automate the creation of a realistic training dataset by exploiting techniques stemming from special effects and harnessing them to target training performance rather than visual appeal. Foreground data is captured by placing sample surgical instruments over a chroma key (a.k.a. green screen) in a controlled environment, thereby making extraction of the relevant image segment straightforward. Multiple lighting conditions and viewpoints can be captured…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsConvolution · *Communicated@Fast*How Do I Communicate to Expedia? · Concatenated Skip Connection · Max Pooling · U-Net
