Assessing YOLACT++ for real time and robust instance segmentation of   medical instruments in endoscopic procedures

Juan Carlos Angeles Ceron; Leonardo Chang; Gilberto Ochoa-Ruiz and; Sharib Ali

arXiv:2103.15997·cs.CV·January 21, 2022

Assessing YOLACT++ for real time and robust instance segmentation of medical instruments in endoscopic procedures

Juan Carlos Angeles Ceron, Leonardo Chang, Gilberto Ochoa-Ruiz and, Sharib Ali

PDF

TL;DR

This paper enhances the YOLACT++ model with attention mechanisms to enable real-time, accurate instance segmentation of medical instruments in endoscopic procedures, addressing the need for clinical applicability.

Contribution

It introduces attention mechanisms to YOLACT++, achieving real-time segmentation with improved accuracy on medical instrument datasets, surpassing previous two-stage detector models.

Findings

01

Achieves 37 fps in real-time segmentation.

02

Matches robustness scores of top challenge winners.

03

Improves accuracy over baseline models.

Abstract

Image-based tracking of laparoscopic instruments plays a fundamental role in computer and robotic-assisted surgeries by aiding surgeons and increasing patient safety. Computer vision contests, such as the Robust Medical Instrument Segmentation (ROBUST-MIS) Challenge, seek to encourage the development of robust models for such purposes, providing large, diverse, and annotated datasets. To date, most of the existing models for instance segmentation of medical instruments were based on two-stage detectors, which provide robust results but are nowhere near to the real-time (5 frames-per-second (fps)at most). However, in order for the method to be clinically applicable, real-time capability is utmost required along with high accuracy. In this paper, we propose the addition of attention mechanisms to the YOLACT architecture that allows real-time instance segmentation of instrument with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.