Training-free Temporal Object Tracking in Surgical Videos
Subhadeep Koley, Abdolrahim Kadkhodamohammadi, Santiago Barbarisi, Danail Stoyanov, Imanol Luengo

TL;DR
This paper introduces a training-free method for real-time object tracking in surgical videos using pre-trained diffusion models, achieving high accuracy without additional training.
Contribution
It leverages pre-trained text-to-image diffusion models for object localization in surgical videos, eliminating the need for costly annotations and training.
Findings
Diffusion features outperform traditional methods in localization accuracy.
The approach achieves 79.19% pixel classification accuracy.
It demonstrates superior performance over existing trackers on surgical datasets.
Abstract
Purpose: In this paper, we present a novel approach for online object tracking in laparoscopic cholecystectomy (LC) surgical videos, targeting localisation and tracking of critical anatomical structures and instruments. Our method addresses the challenges of costly pixel-level annotations and label inconsistencies inherent in existing datasets. Methods: Leveraging the inherent object localisation capabilities of pre-trained text-to-image diffusion models, we extract representative features from surgical frames without any training or fine-tuning. Our tracking framework uses these features, along with cross-frame interactions via an affinity matrix inspired by query-key-value attention, to ensure temporal continuity in the tracking process. Results: Through a pilot study, we first demonstrate that diffusion features exhibit superior object localisation and consistent semantics across…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSurgical Simulation and Training · Face recognition and analysis · Advanced Neural Network Applications
