SURGIVID: Annotation-Efficient Surgical Video Object Discovery
\c{C}a\u{g}han K\"oksal, Ghazal Ghazaei, Nassir Navab

TL;DR
This paper introduces SURGIVID, an annotation-efficient framework for surgical video object discovery that combines self-supervised learning with minimal supervision, achieving performance comparable to fully-supervised methods.
Contribution
The paper presents a novel unsupervised approach for surgical scene segmentation that requires significantly fewer annotations and leverages surgical phase labels for improved accuracy.
Findings
Comparable localization performance with only 36 annotations
Surgical phase labels improve tool localization by ~2%
Effective discovery of relevant surgical objects with minimal supervision
Abstract
Surgical scenes convey crucial information about the quality of surgery. Pixel-wise localization of tools and anatomical structures is the first task towards deeper surgical analysis for microscopic or endoscopic surgical views. This is typically done via fully-supervised methods which are annotation greedy and in several cases, demanding medical expertise. Considering the profusion of surgical videos obtained through standardized surgical workflows, we propose an annotation-efficient framework for the semantic segmentation of surgical scenes. We employ image-based self-supervised object discovery to identify the most salient tools and anatomical structures in surgical videos. These proposals are further refined within a minimally supervised fine-tuning step. Our unsupervised setup reinforced with only 36 annotation labels indicates comparable localization performance with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsColorectal Cancer Screening and Detection · Image Retrieval and Classification Techniques
MethodsSoftmax · Attention Is All You Need
