TL;DR
ISINet introduces an instance-based segmentation approach with temporal consistency for surgical instruments, significantly improving accuracy on benchmark datasets in robotic-assisted surgery scenes.
Contribution
The paper presents ISINet, a novel instance-based segmentation network that incorporates temporal information, outperforming existing methods in surgical instrument segmentation.
Findings
ISINet doubles the IoU of previous methods.
Complete ISINet model triples the IoU.
Outperforms state-of-the-art on benchmark datasets.
Abstract
We study the task of semantic segmentation of surgical instruments in robotic-assisted surgery scenes. We propose the Instance-based Surgical Instrument Segmentation Network (ISINet), a method that addresses this task from an instance-based segmentation perspective. Our method includes a temporal consistency module that takes into account the previously overlooked and inherent temporal information of the problem. We validate our approach on the existing benchmark for the task, the Endoscopic Vision 2017 Robotic Instrument Segmentation Dataset, and on the 2018 version of the dataset, whose annotations we extended for the fine-grained version of instrument segmentation. Our results show that ISINet significantly outperforms state-of-the-art methods, with our baseline version duplicating the Intersection over Union (IoU) of previous methods and our complete model triplicating the IoU.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
