Task-Aware Asynchronous Multi-Task Model with Class Incremental Contrastive Learning for Surgical Scene Understanding
Lalithkumar Seenivasan, Mobarakol Islam, Mengya Xu, Chwee Ming Lim and, Hongliang Ren

TL;DR
This paper introduces a task-aware asynchronous multi-task learning model with class incremental contrastive learning and curriculum learning to improve surgical scene understanding across domain shifts, enabling real-time tool-tissue interaction detection and report generation.
Contribution
It proposes a novel multi-task model with domain adaptation techniques, including CICL and curriculum learning, for surgical scene understanding, addressing domain shifts and computational efficiency.
Findings
Balanced performance on both tasks with BLEU score 0.4049 and accuracy 0.3508
Model adapts to domain shifts and novel instruments effectively
Performs on par with single-task models in domain adaptation
Abstract
Purpose: Surgery scene understanding with tool-tissue interaction recognition and automatic report generation can play an important role in intra-operative guidance, decision-making and postoperative analysis in robotic surgery. However, domain shifts between different surgeries with inter and intra-patient variation and novel instruments' appearance degrade the performance of model prediction. Moreover, it requires output from multiple models, which can be computationally expensive and affect real-time performance. Methodology: A multi-task learning (MTL) model is proposed for surgical report generation and tool-tissue interaction prediction that deals with domain shift problems. The model forms of shared feature extractor, mesh-transformer branch for captioning and graph attention branch for tool-tissue interaction prediction. The shared feature extractor employs class incremental…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Surgical Simulation and Training · Artificial Intelligence in Healthcare and Education
MethodsContrastive Learning
