Multi-label Classification of Surgical Tools with Convolutional Neural   Networks

Jonas Prellberg; Oliver Kramer

arXiv:1805.05760·cs.CV·May 16, 2018

Multi-label Classification of Surgical Tools with Convolutional Neural Networks

Jonas Prellberg, Oliver Kramer

PDF

TL;DR

This paper develops a convolutional neural network-based system to automatically detect 21 different surgical tools in cataract surgery videos, addressing challenges like class imbalance for real-time surgical assistance.

Contribution

It introduces a residual network architecture tailored for multi-label surgical tool detection and explores various design choices through extensive experiments.

Findings

01

Achieved effective multi-label tool detection in cataract videos

02

Addressed class imbalance issues in surgical image datasets

03

Provided insights into optimal network design for surgical tool recognition

Abstract

Automatic tool detection from surgical imagery has a multitude of useful applications, such as real-time computer assistance for the surgeon. Using the successful residual network architecture, a system that can distinguish 21 different tools in cataract surgery videos is created. The videos are provided as part of the 2017 CATARACTS challenge and pose difficulties found in many real-world datasets, for example a strong class imbalance. The construction of the detection system is guided by a wide array of experiments that explore different design decisions.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.