Weakly Supervised Convolutional LSTM Approach for Tool Tracking in Laparoscopic Videos
Chinedu Innocent Nwoye, Didier Mutter, Jacques Marescaux, Nicolas, Padoy

TL;DR
This paper introduces a weakly supervised deep learning method using CNN and ConvLSTM to track surgical tools in laparoscopic videos with only binary presence labels, outperforming baseline models in detection and localization.
Contribution
The novel approach leverages ConvLSTM to utilize spatio-temporal information for tool tracking without requiring detailed annotations, reducing data annotation effort.
Findings
Outperforms baseline in tool presence detection by over 5%.
Achieves 13.9% improvement in spatial localization.
Enhances motion tracking accuracy by 12.6%.
Abstract
Purpose: Real-time surgical tool tracking is a core component of the future intelligent operating room (OR), because it is highly instrumental to analyze and understand the surgical activities. Current methods for surgical tool tracking in videos need to be trained on data in which the spatial positions of the tools are manually annotated. Generating such training data is difficult and time-consuming. Instead, we propose to use solely binary presence annotations to train a tool tracker for laparoscopic videos. Methods: The proposed approach is composed of a CNN + Convolutional LSTM (ConvLSTM) neural network trained end-to-end, but weakly supervised on tool binary presence labels only. We use the ConvLSTM to model the temporal dependencies in the motion of the surgical tools and leverage its spatio-temporal ability to smooth the class peak activations in the localization heat maps…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSurgical Simulation and Training · 3D Shape Modeling and Analysis · Augmented Reality Applications
Methods1-Dimensional Convolutional Neural Networks
