Rendezvous in Time: An Attention-based Temporal Fusion approach for   Surgical Triplet Recognition

Saurav Sharma; Chinedu Innocent Nwoye; Didier Mutter; Nicolas Padoy

arXiv:2211.16963·cs.CV·June 19, 2023

Rendezvous in Time: An Attention-based Temporal Fusion approach for Surgical Triplet Recognition

Saurav Sharma, Chinedu Innocent Nwoye, Didier Mutter, Nicolas Padoy

PDF

Open Access 1 Repo

TL;DR

This paper introduces RiT, a deep learning model that enhances surgical triplet recognition by incorporating temporal attention to better understand the evolution of actions in surgical videos.

Contribution

The paper presents a novel temporal attention-based fusion method extending existing models for improved surgical triplet recognition.

Findings

01

Improved recognition accuracy for verbs and triplets.

02

Smoother and more consistent triplet predictions.

03

Effective temporal modeling of surgical actions.

Abstract

One of the recent advances in surgical AI is the recognition of surgical activities as triplets of (instrument, verb, target). Albeit providing detailed information for computer-assisted intervention, current triplet recognition approaches rely only on single frame features. Exploiting the temporal cues from earlier frames would improve the recognition of surgical action triplets from videos. In this paper, we propose Rendezvous in Time (RiT) - a deep learning model that extends the state-of-the-art model, Rendezvous, with temporal modeling. Focusing more on the verbs, our RiT explores the connectedness of current and past frames to learn temporal attention-based features for enhanced triplet recognition. We validate our proposal on the challenging surgical triplet dataset, CholecT45, demonstrating an improved recognition of the verb and triplet along with other interactions involving…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

camma-public/rendezvous-in-time
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSurgical Simulation and Training · Artificial Intelligence in Healthcare and Education · Cardiac, Anesthesia and Surgical Outcomes